{{{#!html }}} {{{#!html }}} = NLP lab seminar = The laboratory seminar is primarily meant to present the activities of the active laboratory members: what they are doing, what results they have, what problems they have, which subtasks they are not able to solve and would like to have a collaboration of someone else, etc. Occasionally, and rather rarely, presentations by related department members can also be expected. The seminar is now held on '''Thursdays at 14:00 in B203''' (Autumn 2021) and is open to anyone interested in the subject (does not have to be active in the lab). It can also be taken as a course [https://is.muni.cz/predmet/fi/pv173 PV173 NLP Lab Seminar] and earn three credits for active participation, including presentation of your results (achieved in NLP Center projects or on a relevant issue). The seminar is given in English. Presentations can be in English, Czech or Slovak. Selected presentations are also presented online at [https://cesnet.zoom.us/j/92417314815 authenticated ZOOM]. Please upload any attachments to the online presentation by following the [wiki:/cs/LaboratorniSeminar/PresentationInstructions instructions]. Uploaded videos are available on the [wiki:/cs/LaboratorniSeminar/Videos video page]. === Presentations wanted: === * === Presentations offered: === * [https://www.leeds.ac.uk/arts/profile/125106/1011/serge_sharoff Serge Sharoff]: Reliable classification of web genres === Seminar programme in the autumn semester 2021 === #seminar_podzim || '''date''' || '''program''' || {{{#!th rowspan=2 '''16.9.''' }}} || seminar programme for this semester || || Aleš Horák: [http://raslan2021.nlp-consulting.net RASLAN 2021] Call for Papers || {{{#!th rowspan=2 '''23.9.''' }}} || Vít Novotný: [htdocs:seminar2021/VNovotny-Summer_NLP_Conferences.pdf SIGIR 2021 and RANLP 2021] || || Adam Rambousek: [http://nlp.fi.muni.cz/projects/ahisto AHISTO project] || {{{#!th rowspan=1 '''30.9.''' }}} || Michaela Denisová: Crosslingual embedding models || {{{#!th rowspan=1 '''7.10.''' }}} || Mikuláš Bankovič: Superresolution techniques for OCR || {{{#!th rowspan=2 '''14.10.''' }}} || Rastislav Papčo: Topic classificaton in web corpora || || Edoardo Signoroni: Corpus alignment by machine translation techniques || {{{#!th rowspan=1 '''21.10.''' }}} || Dalibor Bačovský: Improving the Subword Model of fastText || {{{#!th rowspan=2 '''4.11.''' }}} || Ondřej Sotolář: Facebook conversations classification || || Radoslav Sabol: TBA || {{{#!th rowspan=2 '''11.11.''' }}} || Tereza Vrabcová: TBA || || Adam Hájek: TBA || {{{#!th rowspan=2 '''18.11.''' }}} || Petr Zelina: TBA || || Samuel Špalek: TBA || {{{#!th rowspan=2 '''25.11.''' }}} || Marek Medveď: QA || || Kristína Němcová: TBA || {{{#!th rowspan=2 '''2.12.''' }}} || Tomáš Houfek: Information extraction from medical records || || Daniel Krátký: TBA || {{{#!th rowspan=2 '''9.12.''' }}} || Krištof Anetta, Mahmut Arslan: Electronic health records processing || || Ondřej Herman: TBA || === Seminar programme in the spring semester 2021 === #seminar_jaro || '''date''' || '''program''' || {{{#!th rowspan=1 '''2.3.''' }}} || seminar programme for this semester || {{{#!th rowspan=3 '''9.3.''' }}} || Pavel Rychlý: projekt LINDAT/CLARIAH-CZ || || Pavel Rychlý: projekt strojového překladu || || Pavel Rychlý: projekt generování slovníku || {{{#!th rowspan=2 '''16.3.''' }}} || Helena Medková: Zeugma Detection using Word Sketch || || Vítek Novotný: [htdocs:seminar2020/VNovotny-EDS_EMBED.pdf EDS-MEMBED: Multi-Sense Embeddings Based on Enhanced Distributional Semantic Structures via a Graph Walk over Word Senses] || {{{#!th rowspan=2 '''23.3.''' }}} || Michal Štefánik: Unsupervised Estimation of Out-of-Domain Performance of Language Models || || Marek Medveď: SQAD database update || {{{#!th rowspan=2 '''30.3.''' }}} || Hien Thi Ha: Block type classification from scanned invoices || || Vítek Novotný: Combining log-bilinear language models with Transformers || {{{#!th rowspan=1 '''6.4.''' }}} || Tomáš Houfek: Vytěžování dat z lékařských zpráv || {{{#!th rowspan=2 '''13.4.''' }}} || Mikuláš Bankovič: Application of super-resolution on OCR of historical documents || || Adam Hájek: výpočet GTP-2 na Metacentru || {{{#!th rowspan=2 '''20.4.''' }}} || Tereza Vrabcová: Parallel corpus from web pages || || Vítek Novotný: [htdocs:seminar2020/VNovotny-FastText_Attention_slides.pdf When FastText Pays Attention] ([https://arxiv.org/abs/2104.09691 preprint]) || {{{#!th rowspan=2 '''27.4.''' }}} || Tereza Kinská: Creation of Judikatura corpora of court decisions || || Pavel Rychlý: [htdocs:seminar2020/PRychly-Using_Makefiles.pdf Using Makefiles for NLP projects] || {{{#!th rowspan=1 '''4.5.''' }}} || Petr Zelina: [htdocs:seminar2020/PZelina-ALBERT_training.pdf ALBERT Training with TensorFlow and PyTorch] || {{{#!th rowspan=1 '''11.5.''' }}} || Krištof Anetta: Electronic Health Records processing, Apache cTakes || {{{#!th rowspan=1 '''18.5.''' }}} || Ondřej Sotolář: [htdocs:seminar2021/OSotolar-Personal_Data_Detection.pdf Building a Corpus for Personal Data Detection] || {{{#!th rowspan=1 '''25.5.''' }}} || Michal Starý: [htdocs:seminar2021/MStary-EventDetection.pdf Event Detection] || [[BR]]It is also possible to view the [[cs/LaboratorniSeminarHistorie | seminar programme in preceding semesters]].