Changes between Version 25 and Version 26 of private/NlpInPracticeCourse/NamedEntityRecognition


Ignore:
Timestamp:
Nov 17, 2022, 1:16:03 PM (18 months ago)
Author:
Zuzana Nevěřilová
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • private/NlpInPracticeCourse/NamedEntityRecognition

    v25 v26  
    2727=== References ===
    2828
    29  1. Charles Sutton and Andrew !McCallum: An Introduction to Conditional Random Fields. Foundations and Trends in Machine Learning 4 (4). 2012. [[http://homepages.inf.ed.ac.uk/csutton/publications/crftut-fnt.pdf]]
    3029 1. Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. BERT: pre-training of deep bidirectional transformers for language understanding, 2019. [[https://arxiv.org/abs/1810.04805]]
    31  1. Xinyu Wang, Yong Jiang, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei Huang, Kewei Tu: Automated Concatenation of Embeddings for Structured Prediction. Accepted to Proceedings of ACL-IJCNLP 2021. 17 pages.
    32 [[https://arxiv.org/abs/2010.05006]]
    33 
     30 1. Afshin Rahimi, Yuan Li, and Trevor Cohn. 2019. Massively Multilingual Transfer for NER. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 151–164, Florence, Italy. Association for Computational Linguistics. [[https://aclanthology.org/P19-1015/]]
    3431
    3532== Practical Session ==
    3633
    37 === Czech Named Entity Recognition ===
     34=== Multilingual Named Entity Recognition ===
    3835
    39 In this workshop, we train a new NER application for the Czech language. We work with free resources & software tools: the Czech NE Corpus (CNEC) and the !FastText pre-trained word embeddings. We build a neural network to solve the problem.
     36In this workshop, we train a NER model for any of the languages supported by WikiAnn. We work with the huggingface library, its BERT model for multilingual token classification, and the WikiAnn training data.
    4037
    41381. Create `<YOUR_FILE>`, a text file named `ia161-UCO-04.txt` where ''UCO'' is your university ID.
    42 1. Open Google Colab at [[https://colab.research.google.com/drive/1mnz-P30CLxrxQ0yyqpcLwVJgi7e59shi?usp=sharing]]
    43 1. Follow the instructions in the notebook. There are three obligatory tasks. Write down your answers to `<YOUR_FILE>`.
     391. Open Google Colab at [[https://colab.research.google.com/drive/15UHh44IGyXff_f1EiOHfrh4tBzkX39EI?usp=sharing]]
     401. Follow the instructions in the notebook. There are four obligatory tasks. Write down your answers to `<YOUR_FILE>`.
     411. Submit to the homework vault (Odevzdavarna).