Changes between Version 19 and Version 20 of private/NlpInPracticeCourse/NamedEntityRecognition


Ignore:
Timestamp:
Oct 2, 2019, 10:30:46 AM (5 years ago)
Author:
Zuzana Nevěřilová
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • private/NlpInPracticeCourse/NamedEntityRecognition

    v19 v20  
    7575 You should see results like:
    7676{{{
    77 CRFClassifier tagged 12120 words in 441 documents at 8145.16 words per second.
     77CRFClassifier tagged 19993 words in 900 documents at 2388.94 words per second.
    7878         Entity P       R       F1      TP      FP      FN
    79        LOCATION 0.7962  0.7849  0.7905  332     85      91
    80    ORGANIZATION 0.7059  0.6019  0.6497  192     80      127
    81          PERSON 0.8062  0.8592  0.8319  470     113     77
    82          Totals 0.7814  0.7711  0.7763  994     278     295
     79            LOC 0.7064  0.7586  0.7316  308     128     98
     80            ORG 0.6943  0.5576  0.6185  184     81      146
     81          OTHER 0.6224  0.6498  0.6358  590     358     318
     82            PER 0.7727  0.8236  0.7974  425     125     91
     83         Totals 0.6853  0.6977  0.6914  1507    692     653
    8384}}}
    8485 In the output, the first column is the input tokens, the second column is the correct (gold) answers. Observe the differences. Copy the training result to `<YOUR_FILE>`. Try to estimate in how many cases the model missed an entity, detected incorrectly the boundaries, or classified an entity incorrectly.