Changes between Version 6 and Version 7 of private/NlpInPracticeCourse/Stylometry
- Timestamp:
- Dec 6, 2015, 9:03:17 PM (8 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
private/NlpInPracticeCourse/Stylometry
v6 v7 7 7 == State of the Art == 8 8 9 The analysis of author's characteristic 10 writing style and vocabulary has been used to uncover author's traits such as authorship, age, or gender 11 documents by both manual linguistic approaches and automatic algorithmic methods. 12 13 The most common approach to stylometry problems 14 is to combine stylistic analysis with machine learning techniques: 15 1) specific style markers are extracted, 16 2) a classification procedure is applied to extracted markers 17 18 9 19 === References === 10 11 Approx 3 current papers (preferably from best NLP conferences/journals, eg. [[https://www.aclweb.org/anthology/|ACL Anthology]]) that will be used as a source for the one-hour lecture:12 20 13 21 1. Stamatatos, E. (2009), A Survey of Modern Authorship Attribution Methods (2009), Journal of the American Society for Information Science and Technology, 60(3), 538-556. [[http://www.clips.ua.ac.be/~walter/educational/material/Stamatatos_survey2009.pdf | pdf]] … … 17 25 == Practical Session == 18 26 27 Student will have to 19 28 Concrete description of work assignment for students for the second one-hour part of the lecture. The work will consist of tasks connected with practical implementations of algorithms connected with the current topic (probably not the state-of-the-art algorithms mentioned in the first part) and with real data. Students can test the algorithms, evaluate them and possibly try some short adaptations for various subtasks. 20 29