Exploring the state of the art in legal QA systems

A Abdallah, B Piryani, A Jatowt - Journal of Big Data, 2023 - Springer
Answering questions related to the legal domain is a complex task, primarily due to the
intricate nature and diverse range of legal document systems. Providing an accurate answer …

Revisiting pre-trained models for Chinese natural language processing

Y Cui, W Che, T Liu, B Qin, S Wang, G Hu - arXiv preprint arXiv …, 2020 - arxiv.org
Bidirectional Encoder Representations from Transformers (BERT) has shown marvelous
improvements across various NLP tasks, and consecutive variants have been proposed to …

Pre-training with whole word masking for chinese bert

Y Cui, W Che, T Liu, B Qin… - IEEE/ACM Transactions on …, 2021 - ieeexplore.ieee.org
Bidirectional Encoder Representations from Transformers (BERT) has shown marvelous
improvements across various NLP tasks, and its consecutive variants have been proposed …

How does NLP benefit legal system: A summary of legal artificial intelligence

H Zhong, C Xiao, C Tu, T Zhang, Z Liu… - arXiv preprint arXiv …, 2020 - arxiv.org
Legal Artificial Intelligence (LegalAI) focuses on applying the technology of artificial
intelligence, especially natural language processing, to benefit tasks in the legal domain. In …

Chinesebert: Chinese pretraining enhanced by glyph and pinyin information

Z Sun, X Li, X Sun, Y Meng, X Ao, Q He, F Wu… - arXiv preprint arXiv …, 2021 - arxiv.org
Recent pretraining models in Chinese neglect two important aspects specific to the Chinese
language: glyph and pinyin, which carry significant syntax and semantic information for …

[HTML][HTML] Lawformer: A pre-trained language model for chinese legal long documents

C Xiao, X Hu, Z Liu, C Tu, M Sun - AI Open, 2021 - Elsevier
Legal artificial intelligence (LegalAI) aims to benefit legal systems with the technology of
artificial intelligence, especially natural language processing (NLP). Recently, inspired by …

Disc-lawllm: Fine-tuning large language models for intelligent legal services

S Yue, W Chen, S Wang, B Li, C Shen, S Liu… - arXiv preprint arXiv …, 2023 - arxiv.org
We propose DISC-LawLLM, an intelligent legal system utilizing large language models
(LLMs) to provide a wide range of legal services. We adopt legal syllogism prompting …

Cuad: An expert-annotated nlp dataset for legal contract review

D Hendrycks, C Burns, A Chen, S Ball - arXiv preprint arXiv:2103.06268, 2021 - arxiv.org
Many specialized domains remain untouched by deep learning, as large labeled datasets
require expensive expert annotators. We address this bottleneck within the legal domain by …

VNHSGE: VietNamese High School Graduation Examination Dataset for Large Language Models

XQ Dao, NB Le, TD Vo, XD Phan, BB Ngo… - arXiv preprint arXiv …, 2023 - arxiv.org
The VNHSGE (VietNamese High School Graduation Examination) dataset, developed
exclusively for evaluating large language models (LLMs), is introduced in this article. The …

Legal natural language processing from 2015-2022: A comprehensive systematic mapping study of advances and applications

E Quevedo, T Cerny, A Rodriguez, P Rivas… - IEEE …, 2023 - ieeexplore.ieee.org
The surge in legal text production has amplified the workload for legal professionals, making
many tasks repetitive and time-consuming. Furthermore, the complexity and specialized …