Changes between Version 25 and Version 26 of private/NlpInPracticeCourse/LanguageResourcesFromWeb


Ignore:
Timestamp:
Nov 5, 2019, 11:22:55 PM (4 years ago)
Author:
xsuchom2
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • private/NlpInPracticeCourse/LanguageResourcesFromWeb

    v25 v26  
    3131  * A POS tagged vertical: 3 TAB separated columns: word, lemma (the base form of the word), POS/morphological tag.
    3232  * Text processing pipelines for converting a text file to a 3-column vertical:
    33     * Czech: {{{alba:/opt/majka/majka-desamb-czech.sh | cut -f1-3}}} or a [http://nlp.fi.muni.cz/projekty/rule_ind/index.cgi web interface] (short documents only)
     33    * Czech: {{{asteria04:/opt/majka_pipe/majka-czech_v2.sh | cut -f1-3}}} or a [http://nlp.fi.muni.cz/projekty/rule_ind/index.cgi web interface] (short documents only)
    3434      * See an example below.
    35     * English: {{{alba:/opt/TreeTagger/tools/tt-english\_v2.sh | awk '{print \$1"\textbackslash t"\$3"\textbackslash t"\$2}'}}}
     35    * English: {{{asteria04:/opt/treetagger_pipe/tt-english_v2.1.sh}}}
    3636  * For each plagiarism:
    3737    * describe plagiarsim technique(s) used