Astronomia ex machina: a history, primer and outlook on neural networks in astronomy

MJ Smith, JE Geach - Royal Society Open Science, 2023 - royalsocietypublishing.org
In this review, we explore the historical development and future prospects of artificial
intelligence (AI) and deep learning in astronomy. We trace the evolution of connectionism in …

Embers of autoregression: Understanding large language models through the problem they are trained to solve

RT McCoy, S Yao, D Friedman, M Hardy… - arXiv preprint arXiv …, 2023 - arxiv.org
The widespread adoption of large language models (LLMs) makes it important to recognize
their strengths and limitations. We argue that in order to develop a holistic understanding of …

Driving and suppressing the human language network using large language models

G Tuckute, A Sathe, S Srikant, M Taliaferro… - Nature Human …, 2024 - nature.com
Transformer models such as GPT generate human-like language and are predictive of
human brain responses to language. Here, using functional-MRI-measured brain responses …

Generative representational instruction tuning

N Muennighoff, H Su, L Wang, N Yang, F Wei… - arXiv preprint arXiv …, 2024 - arxiv.org
All text-based language problems can be reduced to either generation or embedding.
Current models only perform well at one or the other. We introduce generative …

Can neural networks do arithmetic? a survey on the elementary numerical skills of state-of-the-art deep learning models

A Testolin - Applied Sciences, 2024 - mdpi.com
Creating learning models that can exhibit sophisticated reasoning abilities is one of the
greatest challenges in deep learning research, and mathematics is rapidly becoming one of …

Scaling law for recommendation models: Towards general-purpose user representations

K Shin, H Kwak, SY Kim, MN Ramström… - Proceedings of the …, 2023 - ojs.aaai.org
Recent advancement of large-scale pretrained models such as BERT, GPT-3, CLIP, and
Gopher, has shown astonishing achievements across various task domains. Unlike vision …

Instructprotein: Aligning human and protein language via knowledge instruction

Z Wang, Q Zhang, K Ding, M Qin, X Zhuang… - arXiv preprint arXiv …, 2023 - arxiv.org
Large Language Models (LLMs) have revolutionized the field of natural language
processing, but they fall short in comprehending biological sequences such as proteins. To …

Infusing behavior science into large language models for activity coaching

N Hegde, M Vardhan, D Nathani… - PLOS Digital …, 2024 - journals.plos.org
Large language models (LLMs) have shown promise for task-oriented dialogue across a
range of domains. The use of LLMs in health and fitness coaching is under-explored …

A survey on text classification: Practical perspectives on the Italian language

A Gasparetto, A Zangari, M Marcuzzo, A Albarelli - Plos one, 2022 - journals.plos.org
Text Classification methods have been improving at an unparalleled speed in the last
decade thanks to the success brought about by deep learning. Historically, state-of-the-art …

Clueweb22: 10 billion web documents with visual and semantic information

A Overwijk, C Xiong, X Liu, C VandenBerg… - arXiv preprint arXiv …, 2022 - arxiv.org
ClueWeb22, the newest iteration of the ClueWeb line of datasets, provides 10 billion web
pages affiliated with rich information. Its design was influenced by the need for a high …