wiki:CzechMwes

Version 3 (modified by xpopelk, 2 months ago) (diff)

--

List of Czech multi-word expressions (MWEs)

Description

The dataset contains 4731 frozen continuous Czech multiword expressions. Inflectional word forms are generated for those MWEs where applicable. In total, the dataset contains 24,807 MWE forms.

More information about the list can be found in NEVĚŘILOVÁ, Zuzana. Annotation of Multi-Word Expressions in Czech Texts. In Horák, Aleš; Rychlý, Pavel; Rambousek, Adam. Ninth Workshop on Recent Advances in Slavonic Natural Language Processing. Brno: Tribun EU, 2015, s. 103-112. ISBN 978-80-263-0974-1..

LINDAT handle

http://hdl.handle.net/11234/1-2427

Acknowledgements

This work has been partly supported by the Masaryk University within the project Čeština v jednotě synchronie a diachronie – 2015 (MUNI/A/1165/2014) and by the Ministry of Education of ČR within the Czech-Norwegian Research Programme in the HaBiT Project 7F14047.

If you use the system, please cite the related publication as well as the LINDAT/CLARIAH infrastructure: http://hdl.handle.net/11234/1-2427.

Publication info

https://www.muni.cz/vyzkum/publikace/1320593

@inproceedings{1320593,
   author = {Nevěřilová, Zuzana},
   address = {Brno},
   booktitle = {Ninth Workshop on Recent Advances in Slavonic Natural Language Processing},
   editor = {Horák, Aleš; Rychlý, Pavel; Rambousek, Adam},
   keywords = {multi-word expressions; corpus; orthographical variants},
   howpublished = {tištěná verze "print"},
   language = {eng},
   location = {Brno},
   isbn = {978-80-263-0974-1},
   pages = {103-112},
   publisher = {Tribun EU},
   title = {Annotation of Multi-Word Expressions in Czech Texts},
   url = {https://nlp.fi.muni.cz/raslan/2015/paper02-Neverilova.pdf},
   year = {2015}
}

License

Public Domain Mark (PD)