Bloom: A 176b-parameter open-access multilingual language model

T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow… - 2023 - inria.hal.science
Large language models (LLMs) have been shown to be able to perform new tasks based on
a few demonstrations or natural language instructions. While these capabilities have led to …

Understanding the benefits and challenges of deploying conversational AI leveraging large language models for public health intervention

E Jo, DA Epstein, H Jung, YH Kim - … of the 2023 CHI Conference on …, 2023 - dl.acm.org
Recent large language models (LLMs) have advanced the quality of open-ended
conversations with chatbots. Although LLM-driven chatbots have the potential to support …

Predictability and surprise in large generative models

D Ganguli, D Hernandez, L Lovitt, A Askell… - Proceedings of the …, 2022 - dl.acm.org
Large-scale pre-training has recently emerged as a technique for creating capable, general-
purpose, generative models such as GPT-3, Megatron-Turing NLG, Gopher, and many …

[PDF][PDF] KLUE: Korean Language Understanding Evaluation

S Park - arXiv preprint arXiv:2105.09680, 2021 - academia.edu
We introduce Korean Language Understanding Evaluation (KLUE) benchmark. KLUE is a
collection of 8 Korean natural language understanding (NLU) tasks, including Topic …

Leveraging large language models to power chatbots for collecting user self-reported data

J Wei, S Kim, H Jung, YH Kim - Proceedings of the ACM on Human …, 2024 - dl.acm.org
Large language models (LLMs) provide a new way to build chatbots by accepting natural
language prompts. Yet, it is unclear how to design prompts to power chatbots to carry on …

The mystery of in-context learning: A comprehensive survey on interpretation and analysis

Y Zhou, J Li, Y Xiang, H Yan, L Gui… - Proceedings of the 2024 …, 2024 - aclanthology.org
Understanding in-context learning (ICL) capability that enables large language models
(LLMs) to excel in proficiency through demonstration examples is of utmost importance. This …

What language model to train if you have one million gpu hours?

TL Scao, T Wang, D Hesslow, L Saulnier… - arXiv preprint arXiv …, 2022 - arxiv.org
The crystallization of modeling methods around the Transformer architecture has been a
boon for practitioners. Simple, well-motivated architectural variations can transfer across …

ChaCha: Leveraging Large Language Models to Prompt Children to Share Their Emotions about Personal Events

W Seo, C Yang, YH Kim - Proceedings of the CHI Conference on Human …, 2024 - dl.acm.org
Children typically learn to identify and express their emotions by sharing stories and feelings
with others, particularly family members. However, it is challenging for parents or siblings to …

Distributed inference and fine-tuning of large language models over the internet

A Borzunov, M Ryabinin… - Advances in …, 2024 - proceedings.neurips.cc
Large language models (LLMs) are useful in many NLP tasks and become more capable
with size, with the best open-source models having over 50 billion parameters. However …

Building a role specified open-domain dialogue system leveraging large-scale language models

S Bae, D Kwak, S Kim, D Ham, S Kang, SW Lee… - arXiv preprint arXiv …, 2022 - arxiv.org
Recent open-domain dialogue models have brought numerous breakthroughs. However,
building a chat system is not scalable since it often requires a considerable volume of …