Changes between Version 4 and Version 5 of private/NlpInPracticeCourse/TopicModelling


Ignore:
Timestamp:
Nov 2, 2015, 9:15:47 PM (8 years ago)
Author:
ymaterna
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • private/NlpInPracticeCourse/TopicModelling

    v4 v5  
    1717In this session we will use [[http://radimrehurek.com/gensim/|Gensim]] to model latent topics of Wikipedia documents. We will focus on Latent Semantic Analysis and Latent Dirichlet Allocation models.
    1818
     191.  Download and extract the corpus of Czech Wikipedia documents:  [[htdocs:bigdata/wiki.tar.bz2|wiki corpus]].
     20
    1921Students will also be required to generate some results of their work and hand them in to prove completing the tasks.