wiki:en/TextCharacteristics

Version 6 (modified by Ales Horak, 7 months ago) (diff)

edited by hales in edit_page_in_vim.py

Text Characteristics

Keyword extraction

Definition Words used to characterise the contents of a document.

Method Select words that appear with statistically unusual frequency in a text

Applications

  • Text classification (topic, spam)
  • Search Engine Optimisation (SEO)
  • Text filtering (job advertising, RSS)
  • Text summarization
  • Text clustering and reorganization

Communication Pattern Analysis

Motivation

  • Analysis of personality traits using author’s verbal style
  • Optimize communication strategies
  • Behaviour prediction

Author’s traits

Problem Definition

Author Writeprint/Stylom

Authorship Verification

Machine learning approach

Accuracy

Conclusions

Keyword Extraction A Brief representation of the content of a document.

Communication Pattern Analysis An analysis of personality traits.

Authorship Recognition An uncovering authorship of anonymous texts.

Attachments (12)

Download all attachments as: .zip