Context Navigation

RelationExtraction

Timestamp:: Dec 7, 2022, 5:11:37 PM (3 years ago)
Author:: xrambous
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

private/NlpInPracticeCourse/RelationExtraction

-                      v15
+                      v16
 == Practical Session ==
+Enhance hypernym detection to provide better results.
+=== Technical Requirements ===
+ * Download [[htdocs:bigdata/ia161-hyper.zip|prepared scripts and data]]:
+ {{{
+wget https://nlp.fi.muni.cz/trac/research/chrome/site/bigdata/ia161-hyper.zip
+}}}
+ * `pip install majka`
+ * Unzip, `cd ia161-hyper` and run {{{./hyper.py}}}
+ * The script reads file {{{vstup.txt}}} (each line is word|definition) and outputs hypernym for each word.
+The task will proceed using Python notebook run in web browser in the [https://colab.research.google.com/ Google Colaboratory] environment
+with the MU G-Suite disk access.
+In case of running the codes in a local environment, the requirements are
+Python 3, and NLTK module.
+ * Access the [https://colab.research.google.com/drive/1kQdFno7kDalQkGSFgSYXT6EDbNSgPruP Python notebook in the Google Colab environment] and make your own copy. Do not forget to save your work if you want to see your changes later, leaving the browser will throw away all changes!
+ * The script reads file {{{input.txt}}} (each line is word|definition) and outputs hypernym for each word.
  * Default approach is naive: ''first noun in definition is hypernym''
+ * majka gives ''noun'' to some ''adjectives'', deal with this to improve results
+ * Update the {{{find_hyper()}}} function in `hyper.py` to provide better results.
+ * Update the {{{find_hyper()}}} function  to provide better results.
  * Upload updated script plus the output.
  * Gold standard to evaluate your result: [[raw-attachment:gold.txt|gold.txt]]
+ * Gold standard to evaluate your result: [[raw-attachment:gold_en.txt|gold_en.txt]]