Changes between Version 7 and Version 8 of sk


Ignore:
Timestamp:
Jun 13, 2013, 11:09:55 PM (11 years ago)
Author:
xmedved1
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • sk

    v7 v8  
    22
    33
    4 == Morphological analysis for slovak ==
     4== Morphological analysis for Slovak ==
    55
    66For Slovak there is only one morphological tagger called '''MORČE'''. '''MORČE''' is Czech morphological tagger based on Averaged Perceptron developed in Prague, Czech Republic in 2007. It was trained on Slovak manually annotated corpus '''r-mak''' and now use for Slovak morphological analysis.
     
    5353[[BR]]
    5454
     55{{{
     56Feature     Accuracy
     57kind        98.10 %
     58genus       93.87 %
     59number      98.76 %
     60case        93.32 %
     61person      96.67 %
     62mod         99.93 %
     63whole tag   92.31 %
     64}}}
     65[[BR]]
     66
     67In this evaluation we don't use any parameters. As a~input for training we use 80% of '''r-mak 3.0''' and we annotate the rest 20% of corpus. Then we determine accuracy between original 20% part of '''r-mak 3.0''' and annotated by '''RFTagger'''.
    5568
    5669{{{
    5770Feature     Accuracy
    58 kind        98.16 %
    59 genus       94.01 %
    60 number      98.78 %
    61 case        93.49 %
    62 person      96.85 %
     71kind        98.02 %
     72genus       95.81 %
     73number      99.24 %
     74case        95.42 %
     75person      98.53 %
    6376mod         99.92 %
    64 whole tag   89.55 %
     77whole tag   94.10 %
    6578}}}
    6679[[BR]]
    6780
    6881   
    69 We do not use any parameters. As a~input for training we use 90% of '''r-mak 3.0''' and we annotate the rest 10% of corpus. Then we determine accuracy between original 10% part of '''r-mak 3.0''' and annotated by '''RFTagger'''.
     82We do use -o POSTag -c 8 -l lexicon parameters. As a~input for training we use 80% of '''r-mak 3.0''' and we annotate the rest 20% of corpus. Then we determine accuracy between original 20% part of '''r-mak 3.0''' and annotated by '''RFTagger'''.
    7083
    7184
     
    7891person      98.53 %
    7992mod         99.898 %
    80 whole tag   92.42 %
     93whole tag   94.016 %
    8194
    8295}}}