Změny mezi verzí 13 a verzí 14 u NerDataset
- Časová značka:
- 30. 11. 2022 14:31:30 (před 20 měsíci)
Vysvětlivky:
- Nezměněno
- Přidáno
- Odstraněno
- Změněno
-
NerDataset
v13 v14 18 18 1. We split the sentences roughly into 80% for training (`training`), 10% for validation (`validation`), and 10% for testing (`testing`).[[BR]]For repeated testing, we subdivide the testing split (`testing_001-400` and `testing_401-500`). 19 19 20 ^1^The extra details include nested entities such as locations in person names (e.g. “Blažek z Kralup”) and people in location names (e.g. “Kostel sv. Martina”).[[BR]]Use the ` search.TaggedSentence.load()` function from [https://gitlab.fi.muni.cz/nlp/ahisto-modules/named-entity-search the ahisto_named_entity_search software tool] to load the `.docx` files togetherwith the extra details.20 ^1^The extra details include nested entities such as locations in person names (e.g. “Blažek z Kralup”) and people in location names (e.g. “Kostel sv. Martina”).[[BR]]Use the `ahisto_named_entity_search.search.TaggedSentence.load()` function from [https://gitlab.fi.muni.cz/nlp/ahisto-modules/named-entity-search the ahisto-named-entity-search software tool] to load the `.docx` files with the extra details. 21 21 22 22 == Citing ==