1 | | = AHISTO NER dataset = |
| 1 | = A Human-Annotated Dataset for Language Modeling and Named Entity Recognition in Medieval Documents = |
| 2 | This is an open dataset of sentences from 19th and 20th century letterpress reprints of documents from the Hussite era. The dataset contains human annotations for named entity recognition (NER). |
| 3 | |
| 4 | You can download the dataset in the LINDAT/CLARIAH-CZ repository. |
| 5 | |
| 6 | == Contents == |
| 7 | == Citing == |
| 8 | If you use our dataset in your work, please cite the following article: |
| 9 | |
| 10 | TODO |
| 11 | |
| 12 | If you use LaTeX, you can use the following BibTeX entry: |
| 13 | |
| 14 | {{{ |
| 15 | TODO |
| 16 | }}} |
| 17 | |
| 18 | == Acknowledgements == |
| 19 | This work was funded by TAČR Éta, [https://starfos.tacr.cz/en/project/TL03000365 project number TL03000365]. |