Question answering (QA) models have shown rapid progress enabled by the availability of large, high-quality benchmark datasets. Such annotated datasets are difficult and costly to …
D Dzendzik, C Vogel, J Foster - arXiv preprint arXiv:2101.10421, 2021 - arxiv.org
This paper surveys 60 English Machine Reading Comprehension datasets, with a view to providing a convenient resource for other researchers interested in this problem. We …
XQ Dao, NB Le, TD Vo, XD Phan, BB Ngo… - arXiv preprint arXiv …, 2023 - arxiv.org
The VNHSGE (VietNamese High School Graduation Examination) dataset, developed exclusively for evaluating large language models (LLMs), is introduced in this article. The …
The paper presents SberQuAD–a large Russian reading comprehension (RC) dataset created similarly to English SQuAD. SberQuAD contains about 50K question-paragraph …
Testing with quiz questions has proven to be an effective way to assess and improve the educational process. However, manually creating quizzes is tedious and time-consuming …
We propose EXAMS--a new benchmark dataset for cross-lingual and multilingual question answering for high school examinations. We collected more than 24,000 high-quality high …
Reading comprehension involves the process of reading and understanding textual information in order to answer questions related to it. It finds practical applications in various …
SK-QuAD is the first manually annotated dataset of questions and answers in Slovak. It consists of more than 91k factual questions and answers from various fields. Each question …
A Chandra, A Fahrizain, SW Laufried - arXiv preprint arXiv:2112.13634, 2021 - arxiv.org
Research in question answering datasets and models has gained a lot of attention in the research community. Many of them release their own question answering datasets as well …