Changes between Version 22 and Version 23 of private/NlpInPracticeCourse/LanguageResourcesFromWeb
- Timestamp:
- Oct 23, 2017, 5:22:35 PM (8 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
private/NlpInPracticeCourse/LanguageResourcesFromWeb
v22 v23 31 31 * Text processing pipelines for converting a text file to a 3-column vertical: 32 32 * Czech: {{{alba:/opt/majka/majka-desamb-czech.sh | cut -f1-3}}} or a [http://nlp.fi.muni.cz/projekty/rule_ind/index.cgi web interface] (short documents only) 33 * See an example below. 33 34 * English: {{{alba:/opt/TreeTagger/tools/tt-english\_v2.sh | awk '{print \$1"\textbackslash t"\$3"\textbackslash t"\$2}'}}} 34 35 * For each plagiarism: … … 76 77 }}} 77 78 79 How to produce the 3-column POS tagged vertical from a plaintext: 80 {{{ 81 scp plagiarism.txt aurora.fi.muni.cz:~/ 82 ssh aurora.fi.muni.cz 83 ssh alba 84 /opt/majka/majka-desamb-czech.sh < ~/plagiarism.txt | cut -f1-3 > ~/plagiarism.vert 85 logout 86 logout 87 scp aurora.fi.muni.cz:~/plagiarism.vert ./ 88 }}} 89 90