Natural language processing Centre
Faculty of Informatics, Masaryk University
Brno, Czech Republic
Brighton, UK & Brno, Czech Republic
I am a researcher in computational linguistics, especially interested in
corpus processing, language modelling and syntactic analysis (mainly
English and Czech), and their real-world applications in language technology.
Since 2006, I work as a specialist at the Natural language
processing centre, Faculty of Informatics, Masaryk University, in Brno.
At the university,
I was involved in collaboration with Lexical Computing research company, founded
by Adam Kilgarriff and
developing the Sketch Engine
querying and management system. Since 2012, I partly work for the company
In 2014 I defended my PhD thesis
application-driven syntactic analysis, also in relation to corpus analysis.
My other publications are mainly on methods of corpus analysis, parsing
and its applications.
I am also a musician, playing violin in Musica Moravica
Kilgarriff, A., Baisa, V., Bušta, J., Jakubíček, M., Kovář, V., Michelfeit, J. Rychlý, P. and Suchomel, V. The Sketch Engine: ten years on.
Lexicography, Springer Berlin/Heidelberg, 2014, vol. 1, No 1, p. 7-36.
Kovář, V. Partial Grammar Checking for Czech Using the SET Parser.
In 17th International Conference, TSD 2014. prvni. Berlin/Heidelberg: Springer Verlag, 2014. p. 308-314
Jakubíček, M., Kilgarriff, A., Kovář, V., Rychlý, P. and Suchomel, V. The TenTen Corpus Family.
In 7th International Corpus Linguistics Conference CL 2013. Lancaster, 2013. p. 125-127.
Kovář, V., Horák, A. and Jakubíček, M. Syntactic Analysis Using Finite Patterns: A New Parsing System for Czech.
In Human Language Technology. Challenges for Computer Science and Linguistics. Berlin/Heidelberg: Springer, 2011. p. 161-171.
Jakubíček, M., Kovář, V. and Grác, M. Through Low-Cost Annotation to Reliable Parsing Evaluation.
In PACLIC 24 Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation. Tokyo: Waseda University, 2010. p. 555-562.
Past and current research projects
- LINDAT/Clarin project (research team member since 2013)
- co-organizing workshops in computer lexicography and corpus linguistics: Lexicom, SKEW (since 2010)
- consultancy in corpus linguistics and corpus technology on behalf of Lexical Computing for major UK publishing houses (Macmillan, Oxford University Press, Cambridge University Press, Harper Collins)
- PRESEMT project (EU FP7/2007-2013, research team member)
- Temporal aspects of logic and information projects (research team member, 2008-2013)