JGLUE: Japanese general language understanding evaluation

K Kurihara, D Kawahara, T Shibata - Proceedings of the Thirteenth …, 2022 - aclanthology.org
To develop high-performance natural language understanding (NLU) models, it is
necessary to have a benchmark to evaluate and analyze NLU ability from various …

ZmBART: An unsupervised cross-lingual transfer framework for language generation

KK Maurya, MS Desarkar, Y Kano… - arXiv preprint arXiv …, 2021 - arxiv.org
Despite the recent advancement in NLP research, cross-lingual transfer for natural language
generation is relatively understudied. In this work, we transfer supervision from high …

A survey on non-english question answering dataset

A Chandra, A Fahrizain, SW Laufried - arXiv preprint arXiv:2112.13634, 2021 - arxiv.org
Research in question answering datasets and models has gained a lot of attention in the
research community. Many of them release their own question answering datasets as well …

Meta-X: A Meta-Learning Approach Based on Language Clustering for Zero-Shot Cross-Lingual Transfer and Generation

KK Maurya, MS Desarkar - arXiv preprint arXiv:2203.10250, 2022 - arxiv.org
Recently, the NLP community has witnessed a rapid advancement in multilingual and cross-
lingual transfer research where the supervision is transferred from high-resource languages …

JDocQA: Japanese Document Question Answering Dataset for Generative Language Models

E Onami, S Kurita, T Miyanishi, T Watanabe - arXiv preprint arXiv …, 2024 - arxiv.org
Document question answering is a task of question answering on given documents such as
reports, slides, pamphlets, and websites, and it is a truly demanding task as paper and …

Limits and challenges of embedding-based question answering in export control expert system

R Rzepka, D Shirafuji, A Obayashi - Procedia Computer Science, 2021 - Elsevier
In this paper, we report our initial findings from creating a question answering module for a
dialog-based expert system which aims at advising users on export control regulations. We …

Native Chinese reader: a dataset towards native-level Chinese machine reading comprehension

S Xu, Y Liu, X Yi, S Zhou, H Li, Y Wu - arXiv preprint arXiv:2112.06494, 2021 - arxiv.org
We present Native Chinese Reader (NCR), a new machine reading comprehension (MRC)
dataset with particularly long articles in both modern and classical Chinese. NCR is …

[PDF][PDF] Annotated question and answer dataset for security export control

A Obayashi, R Rzepka - Proceedings of the 7th Linguistic and …, 2021 - ceur-ws.org
This paper introduces a set of questions and answers in Japanese language for the topics
related to security export control. Unlike the most available datasets for question answering …

JEMHopQA: Dataset for Japanese Explainable Multi-Hop Question Answering

A Ishii, N Inoue, H Suzuki, S Sekine - Proceedings of the 2024 …, 2024 - aclanthology.org
We present JEMHopQA, a multi-hop QA dataset for the development of explainable QA
systems. The dataset consists not only of question-answer pairs, but also of supporting …

Construction of a Corpus of Rhetorical Devices in Slogans and Structural Analysis of Antitheses

A Niwa, N Okazaki, K Wakimoto, K Nishiguchi… - Transactions on Asian …, 2021 - dl.acm.org
An advertising slogan is a sentence that expresses a product or a work of art in a
straightforward manner and is used for advertising and publicity. Moving the consumer's …