Změny mezi verzí 12 a verzí 13 u NerDataset
- Časová značka:
- 30. 11. 2022 14:29:50 (před 20 měsíci)
Vysvětlivky:
- Nezměněno
- Přidáno
- Odstraněno
- Změněno
-
NerDataset
v12 v13 18 18 1. We split the sentences roughly into 80% for training (`training`), 10% for validation (`validation`), and 10% for testing (`testing`).[[BR]]For repeated testing, we subdivide the testing split (`testing_001-400` and `testing_401-500`). 19 19 20 ^1^The extra details include nested entities such as locations in person names (e.g. “Blažek z Kralup”) and people in location names (e.g. “Kostel sv. Martina”). 21 22 Use the `search.TaggedSentence.load()` function from [https://gitlab.fi.muni.cz/nlp/ahisto-modules/named-entity-search the ahisto_named_entity_search software tool] to load the `.docx` files together with the extra details. 20 ^1^The extra details include nested entities such as locations in person names (e.g. “Blažek z Kralup”) and people in location names (e.g. “Kostel sv. Martina”).[[BR]]Use the `search.TaggedSentence.load()` function from [https://gitlab.fi.muni.cz/nlp/ahisto-modules/named-entity-search the ahisto_named_entity_search software tool] to load the `.docx` files together with the extra details. 23 21 24 22 == Citing ==