| 1 | == Language Technologies For Business == |
| 2 | |
| 3 | * '''automatic machine translation''' for specific domains |
| 4 | * description of products, manuals, warranties ... |
| 5 | * '''knowledge mining''', especially from internet text documents |
| 6 | * information analysis of texts |
| 7 | * knowledge indexing |
| 8 | * fuzzy searching (I look for a mid-size bike) |
| 9 | * clustering search results on people, institution, places |
| 10 | * clustering search results on products and their attributes |
| 11 | * '''effective indexing and searching in large data''' |
| 12 | * parallel processing |
| 13 | * continous web crawling |
| 14 | * information extraction from user behavior |
| 15 | * '''intelligent processing of web pages''' |
| 16 | * data cleaning, spam detection, plagiarism detection |
| 17 | * detection of generated content |
| 18 | * innapropriate discussion posts detection |
| 19 | * web pages classification |
| 20 | * keyword extraction |
| 21 | * text summarization |
| 22 | * opinion mining |
| 23 | * '''error correction''' |
| 24 | * spell checkers, grammar checkers, style checkers |
| 25 | * diacritics restoration |
| 26 | * '''language tools''' |
| 27 | * lemmatization, stemming |
| 28 | * parsing, part of speech tagging |
| 29 | * named entities recognition, abbreviation meaning expansion |
| 30 | * anaphora and co-reference resolution |
| 31 | * automatically generated thesauri, synonym dictionaries |
| 32 | * definition extraction, examples of use |
| 33 | |
| 34 | |