Changes between Version 9 and Version 10 of private/AdvancedNlpCourse/LanguageModelling


Ignore:
Timestamp:
Nov 2, 2015, 8:27:08 AM (5 years ago)
Author:
Vít Baisa
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • private/AdvancedNlpCourse/LanguageModelling

    v9 v10  
    3030* {{{cd cblm}}}
    3131
     32Mksary
     33
     34* {{{git clone https://github.com/lh3/libdivsufsort.git}}}
     35* {{{cd libdivsufsort}}}
     36* {{{mkdir build}}}
     37* {{{cmake -DCMAKE_BUILD_TYPE="Release" -DCMAKE_INSTALL_PREFIX="/ABSOLUTE_PATH_TO_LIBDIVSUFSORT"}}}
     38* {{{make}}}
     39* {{{cd ...}}}
     40* {{{ln -s libdivsufsort/examples/mksary mksary}}}
     41
    3242== Training data ==
    3343
    3444To build a new model, we need
    3545* a plain text, see {{{data}}} directory, use {{{lower.py}}}
    36 * to create a suffix array {{{mksary INPUT.txt OUTPUT.sa}}}
     46* to create a suffix array {{{./mksary INPUT.txt OUTPUT.sa}}}
    3747* and compute the prefix tree: {{{python build_trie.py FILE.sa [MINFREQ] [OUPUTFILE]}}}
    3848