Changes between Version 4 and Version 5 of private/NlpInPracticeCourse/LanguageResourcesFromWeb
- Timestamp:
- Oct 19, 2015, 1:26:42 PM (9 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
private/NlpInPracticeCourse/LanguageResourcesFromWeb
v4 v5 11 11 12 12 === References === 13 14 Approx 3 current papers (preferably from best NLP conferences/journals, eg. [[https://www.aclweb.org/anthology/|ACL Anthology]]) that will be used as a source for the one-hour lecture:15 16 13 1. Chapters 19 and 20 from C. D. Manning et al. "Introduction to Information Retrieval". Cambridge University Press, 2008. 17 14 1. Pomikálek, Jan. "Removing boilerplate and duplicate content from web corpora." Dissertation thesis. Masaryk University, 2011. 18 1. HaCohen-Kerner, Yaakov, Aharon Tayeb, and Natan Ben-Dror. "Detection of simple plagiarism in computer science papers." Coling, 2010. 19 1. !TODO another plagiarism detection paper 15 1. Hacohen-Kerner, Yaakov, Aharon Tayeb, and Natan Ben-Dror. "Detection of simple plagiarism in computer science papers." Coling, 2010. 16 1. Potthast et al. "Overview of the 6th International Competition on Plagiarism Detection." CLEF, 2014. 17 1. [http://www.uni-weimar.de/medien/webis/events/pan-15/pan15-web/ 13th evaluation lab on uncovering plagiarism, authorship, and social software misuse] 20 18 21 19 == Practical Session == … … 35 33 - Evaluation: precision, recall, F1 (the calculaton will be a part of the frame script). 36 34 37