The 1st Workshop on Data Contamination (CONDA 2024) focuses on all relevant aspects of data contamination in natural language processing, where data contamination is understood …
The rapid development of Large Language Models (LLMs) like GPT-4, Claude-3, and Gemini has transformed the field of natural language processing. However, it has also …
Y Fu, O Uzuner, M Yetisgen, F Xia - arXiv preprint arXiv:2410.18966, 2024 - arxiv.org
Large language models (LLMs) have demonstrated great performance across various benchmarks, showing potential as general-purpose task solvers. However, as LLMs are …
Test contamination is a serious problem for the evaluation of large language models (LLMs) because it leads to the overestimation of their performance and a quick saturation of …
Relational databases play an important role in business, science, and beyond. However, the operability of relational databases is restricted to users familiar with specific languages such …
C Caramello, A Cigliano, F Fallucchi… - SYSYEM 2024: 10th …, 2024 - ceur-ws.org
The railway industry is a sector where asset maintenance is paramount in ensuring passenger safety and service continuity. In this context, the application of the blockchain …
A Cigliano, F Fallucchi, M Gerardi - … of Yearly Reports on Infor-matics …, 2024 - ceur-ws.org
The advent of digital analysis tools and Large Language Models (LLMs) has significantly altered the landscape of digital humanities, introducing new methodologies for processing …
Earlier works have been showing the efficacy of reasoning methods in eliciting step-wise reasoning of large language models (LLMs) by operating via in-context demonstrations …
An Efficient strategy for conducting pre-training of language models is the concatenation of contiguous sequences of text of fixed length through causal masking that estimates the …