Changes between Version 15 and Version 16 of private/NlpInPracticeCourse/NamedEntityRecognition
- Timestamp:
- Oct 9, 2017, 11:37:57 AM (7 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
private/NlpInPracticeCourse/NamedEntityRecognition
v15 v16 48 48 1. convert the train data to the Stanford NER format: 49 49 {{{ 50 python convert_cnec_stanford.py cnec2.0/data/xml/named_ent_train.xml > named_ent_train.tsv 50 python convert_cnec_stanford.py cnec2.0/data/xml/named_ent_train.xml \ 51 > named_ent_train.tsv 51 52 }}} 52 53 … … 55 56 1. train the model using the default settings (cnec.prop), N.B. that the `convert_cnec_stanford.py` only recognizes PERSON, LOCATION and ORGANIZATION, you can extend the markup conversion later: 56 57 {{{ 57 java -cp stanford-ner.jar edu.stanford.nlp.ie.crf.CRFClassifier -prop cnec.prop 58 java -cp stanford-ner.jar edu.stanford.nlp.ie.crf.CRFClassifier \ 59 -prop cnec.prop 58 60 }}} 59 61 1. convert the test data to the Stanford NER format: 60 62 {{{ 61 python convert_cnec_stanford.py named_ent_dtest.xml > named_ent_dtest.tsv 63 python convert_cnec_stanford.py named_ent_dtest.xml \ 64 > named_ent_dtest.tsv 62 65 }}} 63 66 1. evaluate the model on `dtest`: 64 67 {{{ 65 68 java -cp stanford-ner.jar edu.stanford.nlp.ie.crf.CRFClassifier \ 66 -loadClassifier cnec-3class-model.ser.gz -testFile named_ent_dtest.tsv 69 -loadClassifier cnec-3class-model.ser.gz \ 70 -testFile named_ent_dtest.tsv 67 71 }}} 68 72 … … 80 84 {{{ 81 85 java -cp stanford-ner.jar edu.stanford.nlp.ie.crf.CRFClassifier \ 82 -loadClassifier cnec-3class-model.ser.gz -testFile named_ent_dtest_unknown.tsv 86 -loadClassifier cnec-3class-model.ser.gz \ 87 -testFile named_ent_dtest_unknown.tsv 83 88 }}} 84 89