Legalbench: A collaboratively built benchmark for measuring legal reasoning in large language models

N Guha, J Nyarko, D Ho, C Ré… - Advances in …, 2024 - proceedings.neurips.cc
The advent of large language models (LLMs) and their adoption by the legal community has
given rise to the question: what types of legal reasoning can LLMs perform? To enable …

Task-aware retrieval with instructions

A Asai, T Schick, P Lewis, X Chen, G Izacard… - arXiv preprint arXiv …, 2022 - arxiv.org
We study the problem of retrieval with instructions, where users of a retrieval system
explicitly describe their intent along with their queries. We aim to develop a general-purpose …

Bringing order into the realm of Transformer-based language models for artificial intelligence and law

CM Greco, A Tagarelli - Artificial Intelligence and Law, 2023 - Springer
Transformer-based language models (TLMs) have widely been recognized to be a cutting-
edge technology for the successful development of deep-learning-based solutions to …

Applicability of large language models and generative models for legal case judgement summarization

A Deroy, K Ghosh, S Ghosh - Artificial Intelligence and Law, 2024 - Springer
Automatic summarization of legal case judgements, which are known to be long and
complex, has traditionally been tried via extractive summarization models. In recent years …

How to train long-context language models (effectively)

T Gao, A Wettig, H Yen, D Chen - arXiv preprint arXiv:2410.02660, 2024 - arxiv.org
We study continued training and supervised fine-tuning (SFT) of a language model (LM) to
make effective use of long-context information. We first establish a reliable evaluation …

SUMMEDITS: measuring LLM ability at factual reasoning through the lens of summarization

P Laban, W Kryściński, D Agarwal… - Proceedings of the …, 2023 - aclanthology.org
With the recent appearance of LLMs in practical settings, having methods that can effectively
detect factual inconsistencies is crucial to reduce the propagation of misinformation and …

Llms as factual reasoners: Insights from existing benchmarks and beyond

P Laban, W Kryściński, D Agarwal, AR Fabbri… - arXiv preprint arXiv …, 2023 - arxiv.org
With the recent appearance of LLMs in practical settings, having methods that can effectively
detect factual inconsistencies is crucial to reduce the propagation of misinformation and …

Lextreme: A multi-lingual and multi-task benchmark for the legal domain

J Niklaus, V Matoshi, P Rani, A Galassi… - arXiv preprint arXiv …, 2023 - arxiv.org
Lately, propelled by the phenomenal advances around the transformer architecture, the
legal NLP field has enjoyed spectacular growth. To measure progress, well curated and …

Natural Language Processing for the Legal Domain: A Survey of Tasks, Datasets, Models, and Challenges

F Ariai, G Demartini - arXiv preprint arXiv:2410.21306, 2024 - arxiv.org
Natural Language Processing is revolutionizing the way legal professionals and laypersons
operate in the legal field. The considerable potential for Natural Language Processing in the …

An exploration of hierarchical attention transformers for efficient long document classification

I Chalkidis, X Dai, M Fergadiotis, P Malakasiotis… - arXiv preprint arXiv …, 2022 - arxiv.org
Non-hierarchical sparse attention Transformer-based models, such as Longformer and Big
Bird, are popular approaches to working with long documents. There are clear benefits to …