Changes between Initial Version and Version 1 of SqadDatabase


Ignore:
Timestamp:
May 20, 2024, 6:12:44 PM (2 months ago)
Author:
xpopelk
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • SqadDatabase

    v1 v1  
     1= Simple question answering database version 3.2 (SQAD 3.2)
     2
     3== Description
     4
     5Simple question answering database version 3.2 (SQAD v3.2) created from Czech Wikipedia. The new version consists of more than 16000 records. Each record of SQAD consists of multiple files - question, answer extraction, answer selection, URL, question metadata, and in some cases, answer context.
     6
     7=== Example
     8
     9{{{
     10Example of SQAD record:
     11Original text: Létající jaguár je novela spisovatele Josefa Formánka z roku 2004.
     12Question: Kdo je autorem novely Létající jaguár?
     13Answer: Josef Formánek
     14URL: http://cs.wikipedia.org/wiki/L%C3%A9taj%C3%ADc%C3%AD_jagu%C3%A1r
     15Question type: Person
     16Answer type: Person
     17}}}
     18
     19More information about the database can be found at [https://nlp.fi.muni.cz/projekty/sqad/ SQAD the project page]
     20
     21== LINDAT handle
     22
     23http://hdl.handle.net/11234/1-5019
     24
     25== Acknowledgements
     26
     27If you use the system, please cite the related publication as well as the LINDAT/CLARIAH infrastructure: http://hdl.handle.net/11234/1-5019.
     28
     29Project code: LM2018101
     30
     31Project name: LINDAT/CLARIAH-CZ: Digitální výzkumná infrastruktura pro jazykové technologie, umění a humanitní vědy
     32
     33=== Older versions
     34
     35- SQAD 3.0 http://hdl.handle.net/11234/1-3069
     36- SQAD 2.1 http://hdl.handle.net/11234/1-3068
     37- SQADv2 http://hdl.handle.net/11234/1-2595
     38- SQAD http://hdl.handle.net/11234/1-1463
     39
     40== Publication info
     41
     42
     43- HORÁK, Aleš a Marek MEDVEĎ. SQAD: Simple Question Answering Database. In Eighth Workshop on Recent Advances in Slavonic Natural Language Processing. Brno: Tribun EU, 2014. s. 121-128. ISSN 2336-4289.
     44- MEDVEĎ, Marek a Aleš HORÁK. AQA: Automatic Question Answering System for Czech. In Sojka Petr, Horák Aleš, Kopeček Ivan, Pala Karel. Text, Speech, and Dialogue 19th International Conference, TSD 2016 Brno, Czech Republic, September 12–16, 2016 Proceedings. Switzerland: Springer International Publishing, 2016. s. 270-278. ISBN 978-3-319-45510-5. doi:10.1007/978-3-319-45510-5_31.
     45- Marek Medveď, Radoslav Sabol, and Aleš Horák. Czech Question Answering with Extended SQAD v3.0 Benchmark Dataset. In Horák, Aleš and Rychlý, Pavel and Rambousek, Adam. Proceedings of the Thirteenth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2019. Brno: Tribun EU, 2019. p. 99-108. ISBN 978-80-263-1530-8.
     46- MEDVEĎ, Marek, Aleš HORÁK a Radoslav SABOL. Improving RNN-based Answer Selection for Morphologically Rich Languages. In Ana Rocha, Luc Steels, Jaap van den Herik. Proceedings of the 12th International Conference on Agents and Artificial Intelligence. Portugal: SCITEPRESS, 2020. s. 644-651. ISBN 978-989-758-395-7. doi:10.5220/0008979206440651.
     47- MEDVEĎ, Marek, Radoslav SABOL and Aleš HORÁK. Evaluating Long Contexts in the Czech Answer Selection Task. In Horák, Rychlý, Rambousek. Recent Advances in Slavonic Natural Language Processing (RASLAN 2021). Brno: Tribun EU, 2021, p. 61-69. ISBN 978-80-263-1670-1.
     48- MEDVEĎ, Marek, Aleš HORÁK a Radoslav SABOL. Comparing RNN and Transformer Context Representations in the Czech Answer Selection Task. In Ana Paula Rocha, Luc Steels, Jaap van den Herik. Proceedings of the 14th International Conference on Agents and Artificial Intelligence (ICAART). Portugal: SCITEPRESS, 2022. s. 388-394. ISBN 978-989-758-547-0. doi:10.5220/0000155600003116.
     49
     50
     51If you cite SQAD, please use this citation:
     52
     53{{{
     54@conference{icaart22,
     55   author={Marek Medved. and Radoslav Sabol. and Aleš Horák.},
     56   title={Comparing RNN and Transformer Context Representations in the Czech Answer Selection Task},
     57   booktitle={Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART},
     58   year={2022},
     59   pages={388-394},
     60   publisher={SciTePress},
     61   organization={INSTICC},
     62   doi={10.5220/0010827000003116},
     63   isbn={978-989-758-547-0},
     64   issn={2184-433X},
     65}
     66}}}
     67
     68== License
     69
     70Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0)
     71
     72
     73
     74