Audio flamingo: A novel audio language model with few-shot learning and dialogue abilities

Z Kong, A Goel, R Badlani, W Ping, R Valle… - arXiv preprint arXiv …, 2024 - arxiv.org
Augmenting large language models (LLMs) to understand audio--including non-speech
sounds and non-verbal speech--is critically important for diverse real-world applications of …

Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities

Z Kong, A Goel, R Badlani, W Ping, R Valle… - arXiv e …, 2024 - ui.adsabs.harvard.edu
Augmenting large language models (LLMs) to understand audio--including non-speech
sounds and non-verbal speech--is critically important for diverse real-world applications of …

Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities

Z Kong, A Goel, R Badlani, W Ping, R Valle… - Forty-first International … - openreview.net
Augmenting large language models (LLMs) to understand audio–including non-speech
sounds and non-verbal speech–is critically important for diverse real-world applications of …