= Indexing and Searching Very Large Texts = [[https://is.muni.cz/auth/predmet/fi/ia161|IA161]] [[en/AdvancedNlpCourse|Advanced NLP Course]], Course Guarantee: Aleš Horák Prepared by: Miloš Jakubíček == State of the Art == === References === 1. RYCHLÝ, Pavel, et al. Korpusové manažery a~ jejich efektivní implementace. 2000. 1. JAKUBÍCEK, Miloš; KILGARRIFF, Adam; RYCHLÝ, Pavel. Effective Corpus Virtualization. In: Challenges in the Management of Large Corpora (CMLC-2) Workshop Programme. p. 7. 1. JAKUBICEK, Milos, et al. Fast Syntactic Searching in Very Large Corpora for Many Languages. In: PACLIC. 2010. p. 741-747. == Practical Session == 1. login to alba 1. inspect command-line tools that are part of manatee (rpm -ql manatee) 1. inspect index files of BNC using less, od, lsclex, dumpdrev, dumpdtext 1. inspect the Python API of Manatee using the provided overview