Changes between Initial Version and Version 1 of en/AdvancedNlpCourse2019/CorpusIndexing


Ignore:
Timestamp:
Oct 1, 2020, 3:33:43 PM (4 years ago)
Author:
Ales Horak
Comment:

copied from private/AdvancedNlpCourse/CorpusIndexing

Legend:

Unmodified
Added
Removed
Modified
  • en/AdvancedNlpCourse2019/CorpusIndexing

    v1 v1  
     1= Indexing and Searching Very Large Texts =
     2
     3[[https://is.muni.cz/auth/predmet/fi/ia161|IA161]] [[en/AdvancedNlpCourse|Advanced NLP Course]], Course Guarantee: Aleš Horák
     4
     5Prepared by: Miloš Jakubíček
     6
     7== State of the Art ==
     8
     9=== References ===
     10
     11 1. RYCHLÝ, Pavel, et al. Korpusové manažery a~ jejich efektivní implementace. 2000.
     12 1. JAKUBÍCEK, Miloš; KILGARRIFF, Adam; RYCHLÝ, Pavel. Effective Corpus Virtualization. In: Challenges in the Management of Large Corpora (CMLC-2) Workshop Programme. p. 7.
     13 1. JAKUBICEK, Milos, et al. Fast Syntactic Searching in Very Large Corpora for Many Languages. In: PACLIC. 2010. p. 741-747.
     14
     15== Practical Session ==
     16
     17 1. login to alba
     18 1. inspect command-line tools that are part of manatee (rpm -ql manatee)
     19 1. inspect index files of BNC using less, od, lsclex, dumpdrev, dumpdtext
     20 1. inspect the Python API of Manatee using the provided overview