Changes between Version 12 and Version 13 of private/NlpInPracticeCourse/InformationExtraction


Ignore:
Timestamp:
Nov 15, 2017, 7:15:05 PM (6 years ago)
Author:
Zuzana Nevěřilová
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • private/NlpInPracticeCourse/InformationExtraction

    v12 v13  
    33[[https://is.muni.cz/auth/predmet/fi/ia161|IA161]] [[en/AdvancedNlpCourse|Advanced NLP Course]], Course Guarantee: Aleš Horák
    44
    5 Prepared by: Vojtěch Kovář
     5Prepared by: Zuzana Nevěřilová
    66
    77
     
    3838   * change {{{markupAware}}} to {{{false}}}
    3939   * change {{{sourceUrl}}} to {{{stringContent}}} and paste some news text
    40    * you can find three sample texts here:
     40   * you can find three sample texts here: [raw-attachment:text1.txt text1.txt], [raw-attachment:text2.txt text2.txt], [raw-attachment:text3.txt text3.txt]
    4141 1. Create corpus:
    4242   * right click on Language !Resources/New/GATE Corpus in the left menu
     
    4747So far, GATE did not much more than Stanford NER in lecture 04. Note, however, that all tokens are annotated and POS-tagged.
    4848
    49 We add rules for extracting job titles and the respective person names:
     49We add rules for extracting job titles and the respective person names. The rules are defined in the grammars [raw-attachment:jobtitle.jape] and [raw-attachment:jobtitleperson.jape]
    5050
    5151 1. Right click Processing !Resources/New/JAPE Transducer in the left menu
     
    5656 1. Observer the grammars {{{jobtitle.jape}}} and {{{jobtitleperson.jape}}}
    5757
    58 Add new grammar {{{jobtitleperson.jape}}} and observe the results.
     58Add new transducer with the grammar {{{jobtitleperson.jape}}} and observe the results.
    5959
    6060Optionally, you can add further documents and observe how universal the {{{jobtitleperson.jape}}} grammar is.
    6161
    6262Write your observations to {{{<YOUR_FILE>}}}.
    63 
    64 
    65 You may modify or draw inspiration from [raw-attachment:demo.py this demo script].