Changes between Version 25 and Version 26 of private/NlpInPracticeCourse/NamedEntityRecognition
- Timestamp:
- Nov 17, 2022, 1:16:03 PM (18 months ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
private/NlpInPracticeCourse/NamedEntityRecognition
v25 v26 27 27 === References === 28 28 29 1. Charles Sutton and Andrew !McCallum: An Introduction to Conditional Random Fields. Foundations and Trends in Machine Learning 4 (4). 2012. [[http://homepages.inf.ed.ac.uk/csutton/publications/crftut-fnt.pdf]]30 29 1. Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. BERT: pre-training of deep bidirectional transformers for language understanding, 2019. [[https://arxiv.org/abs/1810.04805]] 31 1. Xinyu Wang, Yong Jiang, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei Huang, Kewei Tu: Automated Concatenation of Embeddings for Structured Prediction. Accepted to Proceedings of ACL-IJCNLP 2021. 17 pages. 32 [[https://arxiv.org/abs/2010.05006]] 33 30 1. Afshin Rahimi, Yuan Li, and Trevor Cohn. 2019. Massively Multilingual Transfer for NER. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 151–164, Florence, Italy. Association for Computational Linguistics. [[https://aclanthology.org/P19-1015/]] 34 31 35 32 == Practical Session == 36 33 37 === CzechNamed Entity Recognition ===34 === Multilingual Named Entity Recognition === 38 35 39 In this workshop, we train a new NER application for the Czech language. We work with free resources & software tools: the Czech NE Corpus (CNEC) and the !FastText pre-trained word embeddings. We build a neural network to solve the problem.36 In this workshop, we train a NER model for any of the languages supported by WikiAnn. We work with the huggingface library, its BERT model for multilingual token classification, and the WikiAnn training data. 40 37 41 38 1. Create `<YOUR_FILE>`, a text file named `ia161-UCO-04.txt` where ''UCO'' is your university ID. 42 1. Open Google Colab at [[https://colab.research.google.com/drive/1mnz-P30CLxrxQ0yyqpcLwVJgi7e59shi?usp=sharing]] 43 1. Follow the instructions in the notebook. There are three obligatory tasks. Write down your answers to `<YOUR_FILE>`. 39 1. Open Google Colab at [[https://colab.research.google.com/drive/15UHh44IGyXff_f1EiOHfrh4tBzkX39EI?usp=sharing]] 40 1. Follow the instructions in the notebook. There are four obligatory tasks. Write down your answers to `<YOUR_FILE>`. 41 1. Submit to the homework vault (Odevzdavarna).