Changes between Version 25 and Version 26 of private/NlpInPracticeCourse/LanguageResourcesFromWeb
- Timestamp:
- Nov 5, 2019, 11:22:55 PM (4 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
private/NlpInPracticeCourse/LanguageResourcesFromWeb
v25 v26 31 31 * A POS tagged vertical: 3 TAB separated columns: word, lemma (the base form of the word), POS/morphological tag. 32 32 * Text processing pipelines for converting a text file to a 3-column vertical: 33 * Czech: {{{a lba:/opt/majka/majka-desamb-czech.sh | cut -f1-3}}} or a [http://nlp.fi.muni.cz/projekty/rule_ind/index.cgi web interface] (short documents only)33 * Czech: {{{asteria04:/opt/majka_pipe/majka-czech_v2.sh | cut -f1-3}}} or a [http://nlp.fi.muni.cz/projekty/rule_ind/index.cgi web interface] (short documents only) 34 34 * See an example below. 35 * English: {{{a lba:/opt/TreeTagger/tools/tt-english\_v2.sh | awk '{print \$1"\textbackslash t"\$3"\textbackslash t"\$2}'}}}35 * English: {{{asteria04:/opt/treetagger_pipe/tt-english_v2.1.sh}}} 36 36 * For each plagiarism: 37 37 * describe plagiarsim technique(s) used