Čeština
English
  • Vítejte na stránkách NLP Centra!
  • Zapojte se do vývoje softwarových nástrojů!
  • Analýza přirozeného jazyka
  • Vyzkoušejte si korpusy o velikosti knihoven online!
  • Studujte jednu ze specializací!
  • Členové laboratoře

Text Characteristics

Keyword extraction

Definition Words used to characterise the contents of a document.

Method Select words that appear with statistically unusual frequency in a text

Applications

  • Text classification (topic, spam)
  • Search Engine Optimisation (SEO)
  • Text filtering (job advertising, RSS)
  • Text summarization
  • Text clustering and reorganization

Communication Pattern Analysis

Motivation

  • Analysis of personality traits using author’s verbal style
  • Optimize communication strategies
  • Behaviour prediction

Author’s traits

Problem Definition

Author Writeprint/Stylom

Authorship Verification

Machine learning approach

Accuracy

Conclusions

Keyword Extraction A Brief representation of the content of a document.

Communication Pattern Analysis An analysis of personality traits.

Authorship Recognition An uncovering authorship of anonymous texts.