TEL'M: Test and Evaluation of Language Models

G Cybenko, J Ackerman, P Lintilhac - arXiv preprint arXiv:2404.10200, 2024 - arxiv.org
Language Models have demonstrated remarkable capabilities on some tasks while failing
dramatically on others. The situation has generated considerable interest in understanding …

TEL'M: Test and Evaluation of Language Models

G Cybenko, J Ackerman, P Lintilhac - arXiv e-prints, 2024 - ui.adsabs.harvard.edu
Abstract Language Models have demonstrated remarkable capabilities on some tasks while
failing dramatically on others. The situation has generated considerable interest in …