Changes between Version 14 and Version 15 of private/NlpInPracticeCourse/AutomaticCorrection
- Timestamp:
- Dec 17, 2015, 9:12:22 PM (8 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
private/NlpInPracticeCourse/AutomaticCorrection
v14 v15 25 25 26 26 == Practical Session == 27 27 28 There will be a short overview of [[https://www.languagetool.org/|LanguageTool]] - Style and Grammar checker. Students can test the language correction algorithm and evaluate it on real data. After they become acquainted with how a spelling corrector works, we will write a simple spelling corrector in Python. The spelling corrector will be trained on a large text file compiled from [[https://www.gutenberg.org/|Project Gutenberg]]. The example will be based on Peter Norvig's [[http://norvig.com/spell-correct.html|Spelling Corrector]] in python. If the student finishes early the additional task is to enhance the spelling corrector's functionality. 28 29 29 30 1. Download prepared script [[https://nlp.fi.muni.cz/trac/research/attachment/wiki/private/AdvancedNlpCourse/AutomaticCorrection/spell.py|spell.py]] and training data collection [[https://nlp.fi.muni.cz/trac/research/attachment/wiki/private/AdvancedNlpCourse/AutomaticCorrection/big.txt|big.txt]]. 30 1. Run spell.py. 31 1. Test the script {{{ python ./spell.py }}} in your working directory. 32 1. Open it in your favourite editor and we will walk through its functionality. 33 34 35 === Task === 36 37 1. Create `<YOUR_FILE>`, a text file named ia161-UCO-14.txt where UCO is your university ID. 38 39 2. Run `spell.py` with developement and final test sets (test1 and test2), write the results in `<YOUR_FILE>`. 40 41 3. Explain the given results in few words and write it in `<YOUR_FILE>`. 42 43 4. Modify the code of `spell.py` to increase accuraccy by 10 %. Write your new accuracy results to `<YOUR_FILE>`. 44 45 5. Run the script with `verbose=True` and examine given results. Try to suggest at least one adjustment how to enhance spellchecker's accuracy. Write your suggestions to `<YOUR_FILE>`. 46 47 -Bonus question- How could you make the implementation faster without changing the results? Write your suggestions to `<YOUR_FILE>`. 48 49 50 === Upload `<YOUR_FILE>` and edited `spell.py` === 51 52 Do not forget to upload your resulting files to the [https://is.muni.cz/auth/el/1433/podzim2015/IA161/ode/59241116/ homework vault (odevzdávárna)].