= Text Characteristics = == Keyword extraction == [[Image(example.png)]] '''Definition''' Words used to characterise the contents of a document. '''Method''' Select words that appear with statistically unusual frequency in a text '''Applications''' * Text classification (topic, spam) * Search Engine Optimisation (SEO) * Text filtering (job advertising, RSS) * Text summarization * Text clustering and reorganization [[Image(seo.png)]] == Communication Pattern Analysis == [[Image(text_characteristics.png)]] '''Motivation''' * Analysis of personality traits using author’s verbal style * Optimize communication strategies * Behaviour prediction [[Image(applications.png)]] == Author’s traits == [[Image(vocabulary.png)]] == Problem Definition == [[Image(auth_ver.png)]] [[Image(auth_att.png)]] [[Image(auth_clus.png)]] == Author !Writeprint/Stylom == [[Image(collection.png)]] == Authorship Verification == [[Image(stylometry.png)]] == Machine learning approach == [[Image(simML.png)]] == Accuracy == [[Image(verification.png)]] == Conclusions == ''' Keyword Extraction ''' A Brief representation of the content of a document. ''' Communication Pattern Analysis ''' An analysis of personality traits. ''' Authorship Recognition ''' An uncovering authorship of anonymous texts.