Context Navigation

Version 1 (modified by Ales Horak, 6 years ago) (diff)
copied from private/AdvancedNlpCourse/CorpusIndexing

Indexing and Searching Very Large Texts

IA161 Advanced NLP Course?, Course Guarantee: Aleš Horák

Prepared by: Miloš Jakubíček

RYCHLÝ, Pavel, et al. Korpusové manažery a~ jejich efektivní implementace. 2000.
JAKUBÍCEK, Miloš; KILGARRIFF, Adam; RYCHLÝ, Pavel. Effective Corpus Virtualization. In: Challenges in the Management of Large Corpora (CMLC-2) Workshop Programme. p. 7.
JAKUBICEK, Milos, et al. Fast Syntactic Searching in Very Large Corpora for Many Languages. In: PACLIC. 2010. p. 741-747.

(optionally) login to aurora
write a program or script that will find all occurrences of a given word form including a small context (at least 5 preceding and succeeding words) in the vertical file
the script will take two arguments: path to the vertical file and word to be searched
If you have logged to aurora, you may use fixed path to the vertical file as
```
/nlp/trac/research/htdocs/bigdata/bnc.vert
```
without the need to copy it.
submit the script into the IS vault