Changes between Version 26 and Version 27 of private/NlpInPracticeCourse/InformationExtraction
- Timestamp:
- Oct 9, 2024, 11:19:58 AM (9 months ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
private/NlpInPracticeCourse/InformationExtraction
v26 v27 34 34 Python 3, and NLTK module. 35 35 36 1. Create {{{<YOUR_FILE>}}}, a text file named {{{ia161-UCO-08.txt}}} where '''UCO''' is your university ID. 36 The tagset of the NLTK POS tagger is based on Penn Treebank, you can check the meaning of the [https://www.ling.upenn.edu/courses/Fall_2003/ling001/penn_treebank_pos.html POS tags]. 37 38 1. Create {{{<YOUR_FILE>}}}, a text file named {{{ia161-UCO.txt}}} where '''UCO''' is your university ID. 37 39 1. Access the [https://colab.research.google.com/drive/1KSfOy8KwKQ6De45ah3JMxP0BfQa-80RD?usp=sharing Python notebook in the Google Colab environment] and make your own copy. Do not forget to save your work if you want to see your changes later, leaving the browser will throw away all changes! 38 40 1. The colab reads file {{{input.txt}}} (each line is word|definition) and outputs hypernym for each word.