Changes between Version 28 and Version 29 of private/NlpInPracticeCourse/LanguageModelling
- Timestamp:
- Oct 2, 2024, 9:24:03 AM (10 months ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
private/NlpInPracticeCourse/LanguageModelling
v28 v29 38 38 39 39 40 We generate random text using these models. The first model is based only on characters, later one uses subword tokenization with [[https://github.com/ rsennrich/subword-nmt|BPE]].40 We generate random text using these models. The first model is based only on characters, later one uses subword tokenization with [[https://github.com/google/sentencepiece|SentecePiece]]. 41 41 42 42 43 Access the [[https://colab.research.google.com/drive/1 zs62Wzc87Hcrc-PpLzgfYT7sN9tey3Pe?usp=sharing|Python notebook in the Google Colab environment]]. Do not forget to save your work if you want to see your changes later, leaving the browser will throw away all changes!43 Access the [[https://colab.research.google.com/drive/17YnlmLJ9xiSZCpr7Z2fhNeQdNoMtJYUm|Python notebook in the Google Colab environment]]. Do not forget to save your work if you want to see your changes later, leaving the browser will throw away all changes! 44 44 45 45 OR