Changes between Version 17 and Version 18 of private/NlpInPracticeCourse/InformationExtraction
- Timestamp:
- Dec 15, 2020, 5:33:00 PM (3 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
private/NlpInPracticeCourse/InformationExtraction
v17 v18 31 31 32 32 1. Create {{{<YOUR_FILE>}}}, a text file named {{{ia161-UCO-08.txt}}} where '''UCO''' is your university ID. 33 1. Download and install GATE (Java 8 is necessary) from https://gate.ac.uk/download/ 33 1. Download and install GATE (Java 8 is necessary) from https://gate.ac.uk/download/. Either run the MS installer or the Java installer, install and run as app or in the command line: 34 34 {{{ 35 35 java -jar gate-<VERSION>-installer.jar … … 53 53 1. Observe the annotated results, click on a document, then `Annotation Sets` and/or `Annotation List`. 54 54 55 So far, GATE did not much more than Stanford NER in lecture 04. Note, however, that all tokens are annotated and POS-tagged. Also note the annotation type Lookup.55 So far, GATE did not much more than Stanford NER. Note, however, that all tokens are annotated and POS-tagged. Also note the annotation type Lookup. 56 56 57 57 We add rules for extracting ''job titles'' and the respective ''person names''. The rules are defined in the grammars [raw-attachment:jobtitle.jape] and [raw-attachment:jobtitleperson.jape]