Version 1 (modified by 4 years ago) (diff) | ,
---|
Automatic relation extraction
IA161 Advanced NLP Course?, Course Guarantee: Aleš Horák
Prepared by: Adam Rambousek
State of the Art
References
- Lefever, Els, Marjan Van de Kauter, and Véronique Hoste. "Evaluation of Automatic Hypernym Extraction from Technical Corpora in English and Dutch." Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014). 2014.
- Wang, Tong, and Graeme Hirst. "Exploring patterns in dictionary definitions for synonym extraction." Natural Language Engineering 18.03 (2012): 313-342.
- Schropp, Gwendolijn, Els Lefever, and Véronique Hoste. "A Combined Pattern-based and Distributional Approach for Automatic Hypernym Detection in Dutch." RANLP. 2013.
- Grefenstette, Gregory. "INRIASAC: Simple Hypernym Extraction Methods." arXiv preprint arXiv:1502.01271 (2015).
- Shen, Yatian, and Xuan-Jing Huang. "Attention-based convolutional neural network for semantic relation extraction." Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers. 2016.
- Li, Qing, et al. "A comprehensive exploration of semantic relation extraction via pre-trained CNNs." Knowledge-Based Systems (2020): 105488.
Practical Session
Enhance hypernym detection to provide better results.
- Download prepared scripts and data:
wget https://nlp.fi.muni.cz/trac/research/chrome/site/bigdata/ia161-hyper.zip
pip install majka
- Unzip,
cd ia161-hyper
and run./hyper.py
- The script reads file
vstup.txt
(each line is word|definition) and outputs hypernym for each word. - Default approach is naive: first noun in definition is hypernym
- majka gives noun to some adjectives, deal with this to improve results
- Update the
find_hyper()
function inhyper.py
to provide better results. - Upload updated script plus the output.
- Gold standard to evaluate your result: gold.txt
Attachments (2)
- gold.txt (512 bytes) - added by 4 years ago.
- ia161-hyper.zip (2.8 MB) - added by 4 years ago.