Context Navigation

NamedEntityRecognition

-                      v19
+                      v20
  You should see results like:
 {{{
 CRFClassifier tagged 12120 words in 441 documents at 8145.16 words per second.
+CRFClassifier tagged 19993 words in 900 documents at 2388.94 words per second.
          Entity P       R       F1      TP      FP      FN
+       LOCATION 0.7962  0.7849  0.7905  332     85      91
+   ORGANIZATION 0.7059  0.6019  0.6497  192     80      127
+         PERSON 0.8062  0.8592  0.8319  470     113     77
+         Totals 0.7814  0.7711  0.7763  994     278     295
+            LOC 0.7064  0.7586  0.7316  308     128     98
+            ORG 0.6943  0.5576  0.6185  184     81      146
+          OTHER 0.6224  0.6498  0.6358  590     358     318
+            PER 0.7727  0.8236  0.7974  425     125     91
+         Totals 0.6853  0.6977  0.6914  1507    692     653
 }}}
  In the output, the first column is the input tokens, the second column is the correct (gold) answers. Observe the differences. Copy the training result to `<YOUR_FILE>`. Try to estimate in how many cases the model missed an entity, detected incorrectly the boundaries, or classified an entity incorrectly.