Změny mezi verzí 32 a verzí 33 u OcrDataset


Ignorovat:
Časová značka:
25. 5. 2023 23:22:26 (před 15 měsíci)
Autor:
xnovot32@fi.muni.cz
Komentář:

--

Vysvětlivky:

Nezměněno
Přidáno
Odstraněno
Změněno
  • OcrDataset

    v32 v33  
    22This is an open dataset of scanned images and OCR texts from 19th and 20th century letterpress reprints of documents from the Hussite era. The dataset contains human annotations for layout analysis, OCR evaluation, and language identification.
    33
    4 You can download [https://hdl.handle.net/11234/1-4615 the dataset from 2021] and [https://hdl.handle.net/11234/1-4935 supplementary materials from 2022] from the LINDAT/CLARIAH-CZ repository.
     4You can download [https://hdl.handle.net/11234/1-4615 the dataset from 2021] and [https://hdl.handle.net/11234/1-4935 supplementary materials from 2022] from the LINDAT/CLARIAH-CZ repository.
    55
    66== Contents ==
     
    3232If you use our dataset in your work, please cite the following articles:
    3333
    34   Novotný, V., Seidlová, K., Vrabcová, T., Horák, A.: When Tesseract Brings Friends: Layout Analysis, Language Identification, and Super-Resolution in the Optical Character Recognition of Medieval Texts. In: Horák, A., Rychlý, P., Rambousek, A. (eds.) ''                           Proceedings of Recent Advances in Slavonic Natural Language Processing, RASLAN 2021''             . pp. 91–100. ISSN 2336-4289. ISBN 978-80-263-1600-8. Tribun EU (2021). Available also from WWW: https://nlp.fi.muni.cz/raslan/2021/paper10.pdf
     34  Novotný, V., Seidlová, K., Vrabcová, T., Horák, A.: When Tesseract Brings Friends: Layout Analysis, Language Identification, and Super-Resolution in the Optical Character Recognition of Medieval Texts. In: Horák, A., Rychlý, P., Rambousek, A. (eds.) ''                            Proceedings of Recent Advances in Slavonic Natural Language Processing, RASLAN 2021''              . pp. 91–100. ISSN 2336-4289. ISBN 978-80-263-1600-8. Tribun EU (2021). Available also from WWW: https://nlp.fi.muni.cz/raslan/2021/paper10.pdf
    3535
    36   Novotný, V., Horák, A.: When Tesseract Meets PERO: Open-Source Optical Character Recognition of Medieval Texts. In: Horák, A., Rychlý, P., Rambousek, A. (eds.) ''             Proceedings of Recent Advances in Slavonic Natural Language Processing, RASLAN 2022''             . pp. 157–160. ISSN 2336-4289. ISBN 978-80-263-1752-4. Tribun EU (2022). Available also from WWW: https://nlp.fi.muni.cz/raslan/2022/paper12.pdf
     36  Novotný, V., Horák, A.: When Tesseract Meets PERO: Open-Source Optical Character Recognition of Medieval Texts. In: Horák, A., Rychlý, P., Rambousek, A. (eds.) ''              Proceedings of Recent Advances in Slavonic Natural Language Processing, RASLAN 2022''              . pp. 157–160. ISSN 2336-4289. ISBN 978-80-263-1752-4. Tribun EU (2022). Available also from WWW: https://nlp.fi.muni.cz/raslan/2022/paper12.pdf
    3737
    3838If you use LaTeX, you can use the following BibTeX entries:
     
    5252  year = {2021},
    5353  issn = {2336-4289},
    54   isbn = {978-80-263-1600-8},
     54  isbn = {978-80-263-1670-1},
    5555  url = {https://nlp.fi.muni.cz/raslan/2021/paper10.pdf},
    5656}