     1= Indexing and Searching Very Large Texts =
     3[[|IA161]] [[en/AdvancedNlpCourse|Advanced NLP Course]], Course Guarantee: Aleš Horák
     5Prepared by: Miloš Jakubíček
     7== State of the Art ==
     9=== References ===
     15== Practical Session ==
     17 1. (optionally) login to aurora
     18 1. write a program or script that will find all occurrences of a given word form including a small context (at least 5 preceding and succeeding words) in the [[htdocs:bigdata/bnc.vert.xz|vertical file]]
     19 1. the script will take two arguments: path to the vertical file and word to be searched [[br]]
     20 If you have logged to aurora, you may use fixed path to the vertical file as
     21 {{{
     24 without the need to copy it.
     25 1. submit the script into the IS vault