Změny mezi verzí 37 a verzí 38 u NerDataset
- Časová značka:
- 25. 5. 2023 11:40:43 (před 14 měsíci)
Vysvětlivky:
- Nezměněno
- Přidáno
- Odstraněno
- Změněno
-
NerDataset
v37 v38 146 146 147 147 == Corpus == 148 The file [https://nlp.fi.muni.cz/projekty/ahisto/ner-dataset/corpus.vert.gz corpus.vert.gz] (1.3G compressed) contains [https://www.sketchengine.eu/my_keywords/vertical/ a vertical file] with the results of optical character recognition, named entity recognition, language identification, and lemmatization.[[BR]]See also [https://nlp.fi.muni.cz/projekty/ahisto/ner-dataset/corpus.schema the schema of the vertical file]. ''(Warning: The corpus is a work in progress and may change. Last modified: 2023-03-09)''[=#corpus.vert]148 The file [https://nlp.fi.muni.cz/projekty/ahisto/ner-dataset/corpus.vert.gz corpus.vert.gz] (1.3G compressed) contains [https://www.sketchengine.eu/my_keywords/vertical/ a vertical file] with the results of optical character recognition, named entity recognition, language identification, and lemmatization.[[BR]]See also [https://gitlab.fi.muni.cz/nlp/ahisto/-/blob/master/corpus/make_corpus/ahisto_bbox the schema of the vertical file]. ''(Warning: The corpus is a work in progress and may change. Last modified: 2023-05-25)''[=#corpus.vert] 149 149 150 150 == Citing == 151 An article describing our dataset is currently under review. Preprint is available 151 An article describing our dataset is currently under review. Preprint is available [mailto:witiko@mail.muni.cz on request].