Změny mezi verzí 8 a verzí 9 u OcrDataset
- Časová značka:
- 30. 11. 2021 10:29:44 (před 3 lety)
Vysvětlivky:
- Nezměněno
- Přidáno
- Odstraněno
- Změněno
-
OcrDataset
v8 v9 1 1 = A Human-Annotated Dataset of Scanned Images and OCR Texts from Medieval Documents = 2 This is an open dataset of scanned images and OCR texts from 19th and 20th century letterpress reprints of documents from the Hussite era. The dataset contains human annotations for layout analysis, OCR evaluation, and language identification. Y iu can [https://nlp.fi.muni.cz/projekty/ahisto/dataset.zip download the dataset here].2 This is an open dataset of scanned images and OCR texts from 19th and 20th century letterpress reprints of documents from the Hussite era. The dataset contains human annotations for layout analysis, OCR evaluation, and language identification. You can [https://nlp.fi.muni.cz/projekty/ahisto/dataset.zip download the dataset here]. 3 3 4 4 == Contents == … … 18 18 If you use our dataset in your work, please cite the following article: 19 19 20 Novotný, V., Seidlová, K., Vrabcová, T., Horák, A.: When Tesseract Brings Friends: Layout Analysis, Language Identification, and Super-Resolution in the Optical Character Recognition of Medieval Texts. In: Horák, A., Rychlý, P., Rambousek, A. (eds.) '' Proceedings of Recent Advances in Slavonic Natural Language Processing, RASLAN 2021'' . pp. 91–100. ISSN 2336-4289. ISBN 978-80-263-1600-8. Tribun EU (2021). Available also from WWW: https://nlp.fi.muni.cz/raslan/2021/paper10.pdf.20 Novotný, V., Seidlová, K., Vrabcová, T., Horák, A.: When Tesseract Brings Friends: Layout Analysis, Language Identification, and Super-Resolution in the Optical Character Recognition of Medieval Texts. In: Horák, A., Rychlý, P., Rambousek, A. (eds.) '' Proceedings of Recent Advances in Slavonic Natural Language Processing, RASLAN 2021'' . pp. 91–100. ISSN 2336-4289. ISBN 978-80-263-1600-8. Tribun EU (2021). Available also from WWW: https://nlp.fi.muni.cz/raslan/2021/paper10.pdf . 21 21 22 22 If you use LaTeX, you can use the following BibTeX entry: