Změny mezi verzí 38 a verzí 39 u NerDataset
- Časová značka:
- 25. 5. 2023 11:43:03 (před 14 měsíci)
Vysvětlivky:
- Nezměněno
- Přidáno
- Odstraněno
- Změněno
-
NerDataset
v38 v39 146 146 147 147 == Corpus == 148 The file [https://nlp.fi.muni.cz/projekty/ahisto/ner-dataset/corpus.vert.gz corpus.vert.gz] (1.3G compressed) contains [https://www.sketchengine.eu/my_keywords/vertical/ a vertical file] with the results of optical character recognition, named entity recognition, language identification, and lemmatization .[[BR]]See also[https://gitlab.fi.muni.cz/nlp/ahisto/-/blob/master/corpus/make_corpus/ahisto_bbox the schema of the vertical file]. ''(Warning: The corpus is a work in progress and may change. Last modified: 2023-05-25)''[=#corpus.vert]148 The file [https://nlp.fi.muni.cz/projekty/ahisto/ner-dataset/corpus.vert.gz corpus.vert.gz] (1.3G compressed) contains [https://www.sketchengine.eu/my_keywords/vertical/ a vertical file] with the results of optical character recognition, named entity recognition, language identification, and lemmatization on all books in the AHISTO project database. See also [https://gitlab.fi.muni.cz/nlp/ahisto/-/blob/master/corpus/make_corpus/ahisto_bbox the schema of the vertical file]. ''(Warning: The corpus is a work in progress and may change. Last modified: 2023-05-25)''[=#corpus.vert] 149 149 150 150 == Citing ==