Changes between Version 38 and Version 39 of private/NlpInPracticeCourse/LanguageResourcesFromWeb


Ignore:
Timestamp:
Oct 16, 2023, 4:11:04 PM (7 months ago)
Author:
xsuchom2
Comment:

references

Legend:

Unmodified
Added
Removed
Modified
  • private/NlpInPracticeCourse/LanguageResourcesFromWeb

    v38 v39  
    1212
    1313=== State of the Art ===
    14  1. Suchomel, Vít. "Better Web Corpora For Corpus Linguistics And NLP." Dissertation thesis. Masaryk University, 2020.
     14 1. Suchomel, Vít. "[https://is.muni.cz/th/u4rmz/Better_Web_Corpora_For_Corpus_Linguistics_And_NLP.pdf Better Web Corpora For Corpus Linguistics And NLP]." Dissertation thesis. Masaryk University, 2020.
     15 1. Jauhiainen, Tommi, Heidi Jauhiainen, and Krister Lindén. "[https://helda.helsinki.fi/bitstream/handle/10138/350001/2022.lrec_1.416.pdf?sequence=1 HeLI-OTS, Off-the-shelf Language Identifier for Text]." In Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022). European Language Resources Association (ELRA), 2022.
    1516 1. Janek Bevendorff, BERTa Chulvi, Gretel Liz De La Peña Sarracén, Mike Kestemont, Enrique Manjavacas, Ilia Markov, Maximilian Mayerl, Martin Potthast, Francisco Rangel, Paolo Rosso, Efstathios Stamatatos, Benno Stein, Matti Wiegmann, Magdalena Wolska, and Eva Zangerle. Overview of PAN 2021: Authorship Verification, Profiling Hate Speech Spreaders on Twitter, and Style Change Detection. In D. Hiemstra, MF. Moens, J. Mothe, R. Perego, M. Potthast, F. Sebastiani, editors, Advances in Information Retrieval (ECIR 2021), March 2021. Springer. [https://pan.webis.de/clef21/pan21-web/]
    1617{{{#!comment
     
    2223}}}
    2324
    24 === Other references ===
    25  3. Chapters 19 and 20 from C. D. Manning et al. "Introduction to Information Retrieval". Cambridge University Press, 2008.
     25=== Other useful references ===
     26 4. Chapters 19 and 20 from C. D. Manning et al. "Introduction to Information Retrieval". Cambridge University Press, 2008.
     27 1. Schäfer, Roland, and Felix Bildhauer. Web corpus construction. Morgan & Claypool Publishers, 2013.
    2628 1. Pomikálek, Jan. "Removing boilerplate and duplicate content from web corpora." Dissertation thesis. Masaryk University, 2011.
    2729