Changes between Version 4 and Version 5 of private/NlpInPracticeCourse/TopicModelling
- Timestamp:
- Nov 2, 2015, 9:15:47 PM (8 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
private/NlpInPracticeCourse/TopicModelling
v4 v5 17 17 In this session we will use [[http://radimrehurek.com/gensim/|Gensim]] to model latent topics of Wikipedia documents. We will focus on Latent Semantic Analysis and Latent Dirichlet Allocation models. 18 18 19 1. Download and extract the corpus of Czech Wikipedia documents: [[htdocs:bigdata/wiki.tar.bz2|wiki corpus]]. 20 19 21 Students will also be required to generate some results of their work and hand them in to prove completing the tasks.