wiki:en/LanguageResources

Version 3 (modified by xrambous, 8 years ago) (diff)

--

Language Resources (work in progress)

Language resources are needed for further NLP applications:

  • synonym dictionary - fuzzy searching
    • over 23000 entries, with over 56000 synonyms
    • Czech Wordnet - 85592 words organized in 40919 synonym sets
  • translation dictionary - multilingual searching
    • Czech-English dictionary - 54000 entries
    • interconnected wordnets (EuroWordnet?, Balkanet) - Czech, English, French, Greek, Polish, Romanian, Turkish
  • vulgar words dictionary - detection of inappropriate behavior in discussions
  • other: dictionary of toponyms? ancient surnames, genealogy? gestures, artworks...?

Attachments (8)

Download all attachments as: .zip