Lawbench: Benchmarking legal knowledge of large language models

Z Fei, X Shen, D Zhu, F Zhou, Z Han, S Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org
Large language models (LLMs) have demonstrated strong capabilities in various aspects.
However, when applying them to the highly specialized, safe-critical legal domain, it is …

Unsupervised layer-wise score aggregation for textual ood detection

M Darrin, G Staerman, EDC Gomes… - Proceedings of the …, 2024 - ojs.aaai.org
Abstract Out-of-distribution (OOD) detection is a rapidly growing field due to new robustness
and security requirements driven by an increased number of AI-based systems. Existing …

Comparing styles across languages

S Havaldar, M Pressimone, E Wong… - arXiv preprint arXiv …, 2023 - arxiv.org
Understanding how styles differ across languages is advantageous for training both humans
and computers to generate culturally appropriate text. We introduce an explanation …

Towards multilingual automatic open-domain dialogue evaluation

J Mendonça, A Lavie, I Trancoso - … of the 24th Annual Meeting of …, 2023 - aclanthology.org
The main limiting factor in the development of robust multilingual open-domain dialogue
evaluation metrics is the lack of multilingual data and the limited availability of open-sourced …

A Comprehensive Evaluation of Parameter-Efficient Fine-Tuning on Software Engineering Tasks

W Zou, Q Li, J Ge, C Li, X Shen, L Huang… - arXiv preprint arXiv …, 2023 - arxiv.org
Pre-trained models (PTMs) have achieved great success in various Software Engineering
(SE) downstream tasks following the``pre-train then fine-tune''paradigm. As fully fine-tuning …

Bridging Cultural Nuances in Dialogue Agents through Cultural Value Surveys

Y Cao, M Chen, D Hershcovich - arXiv preprint arXiv:2401.10352, 2024 - arxiv.org
The cultural landscape of interactions with dialogue agents is a compelling yet relatively
unexplored territory. It's clear that various sociocultural aspects--from communication styles …

xPQA: Cross-Lingual Product Question Answering in 12 Languages

X Shen, A Asai, B Byrne… - Proceedings of the 61st …, 2023 - aclanthology.org
Abstract Product Question Answering (PQA) systems are key in e-commerce applications as
they provide responses to customers' questions as they shop for products. While existing …

Towards Multilingual Automatic Dialogue Evaluation

J Mendonça, A Lavie, I Trancoso - arXiv preprint arXiv:2308.16795, 2023 - arxiv.org
The main limiting factor in the development of robust multilingual dialogue evaluation
metrics is the lack of multilingual data and the limited availability of open sourced …

Is Translation Helpful? An Exploration of Cross-Lingual Transfer in Low-Resource Dialog Generation

L Shen, S Yu, X Shen - 2024 International Joint Conference on …, 2024 - ieeexplore.ieee.org
Cross-lingual transfer is important for developing high-quality chatbots in multiple languages
to address the imbalanced distribution of language resources. A typical approach of cross …

xPQA: Cross-lingual product question answering across 12 languages

X Shen, A Asai, B Byrne, A de Gispert - arXiv preprint arXiv:2305.09249, 2023 - arxiv.org
Product Question Answering (PQA) systems are key in e-commerce applications to provide
responses to customers' questions as they shop for products. While existing work on PQA …