Changes between Version 37 and Version 38 of private/NlpInPracticeCourse/LanguageResourcesFromWeb


Ignore:
Timestamp:
Oct 16, 2023, 11:48:55 AM (7 months ago)
Author:
xsuchom2
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • private/NlpInPracticeCourse/LanguageResourcesFromWeb

    v37 v38  
    99
    1010
    11 == State of the Art ==
     11== References ==
    1212
    13 === References ===
    14  1. Chapters 19 and 20 from C. D. Manning et al. "Introduction to Information Retrieval". Cambridge University Press, 2008.
    15  1. Pomikálek, Jan. "Removing boilerplate and duplicate content from web corpora." Dissertation thesis. Masaryk University, 2011.
     13=== State of the Art ===
    1614 1. Suchomel, Vít. "Better Web Corpora For Corpus Linguistics And NLP." Dissertation thesis. Masaryk University, 2020.
    1715 1. Janek Bevendorff, BERTa Chulvi, Gretel Liz De La Peña Sarracén, Mike Kestemont, Enrique Manjavacas, Ilia Markov, Maximilian Mayerl, Martin Potthast, Francisco Rangel, Paolo Rosso, Efstathios Stamatatos, Benno Stein, Matti Wiegmann, Magdalena Wolska, and Eva Zangerle. Overview of PAN 2021: Authorship Verification, Profiling Hate Speech Spreaders on Twitter, and Style Change Detection. In D. Hiemstra, MF. Moens, J. Mothe, R. Perego, M. Potthast, F. Sebastiani, editors, Advances in Information Retrieval (ECIR 2021), March 2021. Springer. [https://pan.webis.de/clef21/pan21-web/]
     
    2321 1. Potthast et al. "Overview of the 6th International Competition on Plagiarism Detection." CLEF, 2014.
    2422}}}
     23
     24=== Other references ===
     25 3. Chapters 19 and 20 from C. D. Manning et al. "Introduction to Information Retrieval". Cambridge University Press, 2008.
     26 1. Pomikálek, Jan. "Removing boilerplate and duplicate content from web corpora." Dissertation thesis. Masaryk University, 2011.
    2527
    2628