Natural Language Processing Centre
The research in Natural Language Processing Centre at the Faculty of Informatics, Masaryk University, Brno, is devoted to the following areas:
Lexical databases with relation to knowledge representation (Czech WordNet and Czech Lexical Database). Tools for viewing and editing dictionaries represented in XML format.
- Improvement and further development of the tools for morphological analysis of Czech, solving the word derivation problems.
- Development of syntactic parsers (partial and general) for Czech and their exploitation as disambiguators.
- The problems of syntactic and semantic analysis based on Transparent Intensional Logic.
- Building corpora, tagging and disambiguating corpus text, tools for corpus modification, maintenance, corpus managers and graphical interface for corpora.
- Exploring techniques and methods of machine learning for the purpose of disambiguation of corpus data.
Essential educational goal of the NLP Centre. is to offer interesting research opportunities both for pregradual and postgradual students and train them within the new field of language engineering.
Recently (2013), MIT Press published a book by Patrick Hanks with the title Lexical Analysis: Norms and Exploitations. Patrick Hanks worked in NLP Centre at FI MU in the past (2006-2008) under the grant by the Czech Academy of Sciences. In his book, Patrick Hanks offers a wide-ranging empirical investigation of word use and meaning in language. The book fills the need for a lexically based, corpus-driven theoretical approach that will help people understand how words go together in collocational patterns and constructions to make meanings. Lexical Analysis: Norms and Exploitations