Context Navigation

CorpusIndexing

                       v1
+= Indexing and Searching Very Large Texts =
+[[https://is.muni.cz/auth/predmet/fi/ia161|IA161]] [[en/AdvancedNlpCourse|Advanced NLP Course]], Course Guarantee: Aleš Horák
+Prepared by: Miloš Jakubíček
+== State of the Art ==
+=== References ===
+. RYCHLÝ, Pavel, et al. Korpusové manažery a~ jejich efektivní implementace. 2000.
+. JAKUBÍCEK, Miloš; KILGARRIFF, Adam; RYCHLÝ, Pavel. Effective Corpus Virtualization. In: Challenges in the Management of Large Corpora (CMLC-2) Workshop Programme. p. 7.
+. JAKUBICEK, Milos, et al. Fast Syntactic Searching in Very Large Corpora for Many Languages. In: PACLIC. 2010. p. 741-747.
+== Practical Session ==
+. login to aurora
+. write a program or script that will find all occurrences of a given word form including a small context (at least 5 preceding and succeeding words) in the vertical file {{{/corpora-fast1/vert/bnc/bnc.vert}}}
+. the script will take two arguments: path to the vertical file and word to be searched
+. submit the script into the IS vault