Changes between Version 37 and Version 38 of private/NlpInPracticeCourse/LanguageResourcesFromWeb
- Timestamp:
- Oct 16, 2023, 11:48:55 AM (7 months ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
private/NlpInPracticeCourse/LanguageResourcesFromWeb
v37 v38 9 9 10 10 11 == State of the Art==11 == References == 12 12 13 === References === 14 1. Chapters 19 and 20 from C. D. Manning et al. "Introduction to Information Retrieval". Cambridge University Press, 2008. 15 1. Pomikálek, Jan. "Removing boilerplate and duplicate content from web corpora." Dissertation thesis. Masaryk University, 2011. 13 === State of the Art === 16 14 1. Suchomel, Vít. "Better Web Corpora For Corpus Linguistics And NLP." Dissertation thesis. Masaryk University, 2020. 17 15 1. Janek Bevendorff, BERTa Chulvi, Gretel Liz De La Peña Sarracén, Mike Kestemont, Enrique Manjavacas, Ilia Markov, Maximilian Mayerl, Martin Potthast, Francisco Rangel, Paolo Rosso, Efstathios Stamatatos, Benno Stein, Matti Wiegmann, Magdalena Wolska, and Eva Zangerle. Overview of PAN 2021: Authorship Verification, Profiling Hate Speech Spreaders on Twitter, and Style Change Detection. In D. Hiemstra, MF. Moens, J. Mothe, R. Perego, M. Potthast, F. Sebastiani, editors, Advances in Information Retrieval (ECIR 2021), March 2021. Springer. [https://pan.webis.de/clef21/pan21-web/] … … 23 21 1. Potthast et al. "Overview of the 6th International Competition on Plagiarism Detection." CLEF, 2014. 24 22 }}} 23 24 === Other references === 25 3. Chapters 19 and 20 from C. D. Manning et al. "Introduction to Information Retrieval". Cambridge University Press, 2008. 26 1. Pomikálek, Jan. "Removing boilerplate and duplicate content from web corpora." Dissertation thesis. Masaryk University, 2011. 25 27 26 28