2 | | This is an open dataset of scanned images and OCR texts from 19th and 20th century letterpress reprints of documents from the Hussite era. The dataset contains human annotations for layout analysis, OCR evaluation, and language identification. You can [http://hdl.handle.net/11234/1-4615 download the dataset in the LINDAT/CLARIAH-CZ repository]. |
| 2 | This is an open dataset of scanned images and OCR texts from 19th and 20th century letterpress reprints of documents from the Hussite era. The dataset contains human annotations for layout analysis, OCR evaluation, and language identification. |
| 3 | |
| 4 | You can [http://hdl.handle.net/11234/1-4615 download the dataset in the LINDAT/CLARIAH-CZ repository]. |