Text-free prosody-aware generative spoken language modeling

E Kharitonov, A Lee, A Polyak, Y Adi, J Copet… - arXiv preprint arXiv …, 2021 - arxiv.org
Speech pre-training has primarily demonstrated efficacy on classification tasks, while its
capability of generating novel speech, similar to how GPT-2 can generate coherent …

Automatic summarization

A Nenkova, K McKeown - Foundations and Trends® in …, 2011 - nowpublishers.com
It has now been 50 years since the publication of Luhn's seminal paper on automatic
summarization. During these years the practical need for automatic summarization has …

Dialogue act modeling for automatic tagging and recognition of conversational speech

A Stolcke, K Ries, N Coccaro, E Shriberg… - Computational …, 2000 - direct.mit.edu
We describe a statistical approach for modeling dialogue acts in conversational speech, ie,
speech-act-like units such as Statement, Question, Backchannel, Agreement, Disagreement …

Tracing emotion: an overview

R Cowie, G McKeown… - International Journal of …, 2012 - igi-global.com
Computational research with continuous representations depends on obtaining continuous
representations from human labellers. The main method used for that purpose is tracing …

Behavioral signal processing: Deriving human behavioral informatics from speech and language

S Narayanan, PG Georgiou - Proceedings of the IEEE, 2013 - ieeexplore.ieee.org
The expression and experience of human behavior are complex and multimodal and
characterized by individual and contextual heterogeneity and variability. Speech and …

Acoustic correlates of information structure

M Breen, E Fedorenko, M Wagner… - Language and cognitive …, 2010 - Taylor & Francis
This paper reports three studies aimed at addressing three questions about the acoustic
correlates of information structure in English:(1) do speakers mark information structure …

Emotion recognition of affective speech based on multiple classifiers using acoustic-prosodic information and semantic labels

CH Wu, WB Liang - IEEE Transactions on Affective Computing, 2010 - ieeexplore.ieee.org
This work presents an approach to emotion recognition of affective speech based on
multiple classifiers using acoustic-prosodic information (AP) and semantic labels (SLs). For …

[图书][B] Prosodic patterns in English conversation

NG Ward - 2019 - books.google.com
Language is more than words: it includes the prosodic features and patterns that we use,
subconsciously, to frame meanings and achieve our goals in our interaction with others …

Enriching speech recognition with automatic detection of sentence boundaries and disfluencies

Y Liu, E Shriberg, A Stolcke, D Hillard… - … on audio, speech …, 2006 - ieeexplore.ieee.org
Effective human and automatic processing of speech requires recovery of more than just the
words. It also involves recovering phenomena such as sentence boundaries, filler words …

Prosodic correlates of sentences in signed languages: A literature review and suggestions for new types of studies

E Ormel, O Crasborn - Sign Language Studies, 2012 - JSTOR
This article contains a literature review of evidence of large prosodic domains that
correspond to syntactic units such as a clause or a sentence. In particular, different phonetic …