== Language Resources (work in progress) == Language resources are needed for further NLP applications: * synonym dictionary - fuzzy searching * over 23000 entries, with over 56000 synonyms * Czech Wordnet - 85592 words organized in 40919 synonym sets * translation dictionary - multilingual searching * Czech-English dictionary - 54000 entries * interconnected wordnets (!EuroWordnet, Balkanet) - Czech, English, Dutch, Italian, Spanish, French, Greek, Polish, Romanian, Turkish (at least 8500 common synonimical sets) * vulgar words dictionary - detection of inappropriate behavior in discussions * other: dictionary of toponyms? ancient surnames, genealogy? gestures, artworks...?