100,000 podcasts: A spoken English document corpus

A Clifton, S Reddy, Y Yu, A Pappu… - Proceedings of the …, 2020 - aclanthology.org
Podcasts are a large and growing repository of spoken audio. As an audio format, podcasts
are more varied in style and production type than broadcast news, contain more genres than …

Speechfind: Advances in spoken document retrieval for a national gallery of the spoken word

JHL Hansen, R Huang, B Zhou… - … on Speech and …, 2005 - ieeexplore.ieee.org
Advances in formulating spoken document retrieval for a new National Gallery of the Spoken
Word (NGSW) are addressed. NGSW is the first large-scale repository of its kind, consisting …

[PDF][PDF] Summarization of spontaneous conversations

X Zhu, G Penn - Ninth International Conference on Spoken …, 2006 - academia.edu
Most speech summarization research is conducted on broadcast news. In our viewpoint,
spontaneous conversations are a more “typical” speech source that distinguishes speech …

[PDF][PDF] Using document summarization techniques for speech data subset selection

K Wei, Y Liu, K Kirchhoff, J Bilmes - … of the 2013 Conference of the …, 2013 - aclanthology.org
In this paper we leverage methods from submodular function optimization developed for
document summarization and apply them to the problem of subselecting acoustic data. We …

[PDF][PDF] Tracking and summarizing news on a daily basis with Columbia's Newsblaster

KR McKeown, R Barzilay, D Evans… - Proceedings of the …, 2002 - davidkirkevans.com
Recently, there have been significant advances in several areas of language technology,
including clustering, text categorization, and summarization. However, efforts to combine …

SCANMail: a voicemail interface that makes speech browsable, readable and searchable

S Whittaker, J Hirschberg, B Amento, L Stark… - Proceedings of the …, 2002 - dl.acm.org
Increasing amounts of public, corporate, and private speech data are now available on-line.
These are limited in their usefulness, however, by the lack of tools to permit their browsing …

Automatic twitter topic summarization with speech acts

R Zhang, W Li, D Gao, Y Ouyang - IEEE transactions on audio …, 2012 - ieeexplore.ieee.org
With the growth of the social media service of Twitter, automatic summarization of Twitter
messages (tweets) is in urgent need for efficient processing of the massive tweeted …

Extractive summarization of meeting recordings.

G Murray, S Renals, J Carletta - 2005 - era.ed.ac.uk
Several approaches to automatic speech summarization are discussed below, using the
ICSI Meetings corpus. We contrast feature-based approaches using prosodic and lexical …

[PDF][PDF] Introduction to the special issue on summarization

D Radev, E Hovy, K McKeown - Computational linguistics, 2002 - aclanthology.org
As the amount of on-line information increases, systems that can automatically summarize
one or more documents become increasingly desirable. Recent research has investigated …

Spoken content retrieval: A survey of techniques and technologies

M Larson, GJF Jones - Foundations and Trends® in …, 2012 - nowpublishers.com
Speech media, that is, digital audio and video containing spoken content, has blossomed in
recent years. Large collections are accruing on the Internet as well as in private and …