Změny mezi verzí 37 a verzí 38 u NerDataset


Ignorovat:
Časová značka:
25. 5. 2023 11:40:43 (před 14 měsíci)
Autor:
xnovot32@fi.muni.cz
Komentář:

--

Vysvětlivky:

Nezměněno
Přidáno
Odstraněno
Změněno
  • NerDataset

    v37 v38  
    146146
    147147== Corpus ==
    148 The file [https://nlp.fi.muni.cz/projekty/ahisto/ner-dataset/corpus.vert.gz corpus.vert.gz] (1.3G compressed) contains [https://www.sketchengine.eu/my_keywords/vertical/ a vertical file] with the results of optical character recognition, named entity recognition, language identification, and lemmatization.[[BR]]See also [https://nlp.fi.muni.cz/projekty/ahisto/ner-dataset/corpus.schema the schema of the vertical file]. ''(Warning: The corpus is a work in progress and may change. Last modified: 2023-03-09)''[=#corpus.vert]
     148The file [https://nlp.fi.muni.cz/projekty/ahisto/ner-dataset/corpus.vert.gz corpus.vert.gz] (1.3G compressed) contains [https://www.sketchengine.eu/my_keywords/vertical/ a vertical file] with the results of optical character recognition, named entity recognition, language identification, and lemmatization.[[BR]]See also [https://gitlab.fi.muni.cz/nlp/ahisto/-/blob/master/corpus/make_corpus/ahisto_bbox the schema of the vertical file]. ''(Warning: The corpus is a work in progress and may change. Last modified: 2023-05-25)''[=#corpus.vert]
    149149
    150150== Citing ==
    151 An article describing our dataset is currently under review. Preprint is available [mailto:witiko@mail.muni.cz on request].
     151An article describing our dataset is currently under review. Preprint is available [mailto:witiko@mail.muni.cz on request].