Changes between Initial Version and Version 1 of Lindat/CzAccent

Apr 7, 2021, 10:30:23 AM (2 years ago)



  • Lindat/CzAccent

    v1 v1  
     1= Czaccent
     3[[Image(index.png, align=right)]]
     5== Description
     6The czaccent system adds diacritics into Czech text without diacritics; it uses statictical evaluation of all possible variants. The working data was trained on a very large Czech corpus. The system can be used as a command line tool, or a web-service. It is also available as API, see <a href="">.
     8More information about the system can be found in RYCHLÝ, Pavel. CzAccent - Simple Tool for Restoring Accents in Czech Texts. In Aleš Horák, Pavel Rychlý (eds.). 6th Workshop on Recent Advances in Slavonic Natural Language Processing. Brno: Tribun EU, 2012. s. 15-22. ISBN 978-80-263-0313-8.
     10== How to use the tool
     11You can insert text long up to tens of kB into the input field. If the conversion doesn't work for longer texts, the problem is in your browser or on the way to server (firewall, proxy).
     13The data from the form can be saved in two ways - by default, for the purposes of improving the service, we save the entered plain text (and no other data related to the query) as test data. If you choose the 'Neukládat text' option, we only save the IP address of the request so that we know how often this service is requested. You can request that your data be deleted if necessary by emailing the contact below. You must specify the text or IP address in the request as we are unable to
     14identify them otherwise.
     16[tady by měl být funkční formulář (viz [ původní stránku])]
     18== Acknowledgements
     19This software was developed within the projects LC536 and 2C06009 and is owned by Masaryk University, Faculty of Informatics, NLP Centre.
     21If you use the system, please, cite the related publication as well as the LINDAT/CLARIAH infrastructure: [link do repozitáře (handle daného submission)]
     23@inproceedings{czaccent,[[BR]]   author = {Rychlý, Pavel},[[BR]]   address = {Brno},[[BR]]   booktitle = {6th Workshop on Recent Advances in Slavonic Natural Language Processing},[[BR]]   editor = {Aleš Horák, Pavel Rychlý},[[BR]]   location = {Brno},[[BR]]   isbn = {978-80-263-0313-8},[[BR]]   pages = {15-22},[[BR]]   publisher = {Tribun EU},[[BR]]   title = {!CzAccent - Simple Tool for Restoring Accents in Czech Texts},[[BR]]   year = !{2012}[[BR]]}
     25== Licence
     26License terms can be found [ here].