Automatic speech recognition and speech variability: A review

M Benzeghiba, R De Mori, O Deroo, S Dupont… - Speech …, 2007 - Elsevier
Major progress is being recorded regularly on both the technology and exploitation of
automatic speech recognition (ASR) and spoken language systems. However, there are still …

Human language technology: Opportunities and challenges

M Ostendorf, E Shriberg… - Proceedings.(ICASSP'05) …, 2005 - ieeexplore.ieee.org
In recent years, there has been dramatic progress in both speech and language processing,
in many cases leveraging some of the same underlying methods. This progress and the …

Automated generation of 'good enough'transcripts as a first step to transcription of audio-recorded data

C Bokhove, C Downey - Methodological innovations, 2018 - journals.sagepub.com
In the last decade, automated captioning services have appeared in mainstream technology
use. Until now, the focus of these services have been on the technical aspects, supporting …

How might we create better benchmarks for speech recognition?

A Aksënova, D van Esch, J Flynn… - Proceedings of the 1st …, 2021 - aclanthology.org
The applications of automatic speech recognition (ASR) systems are proliferating, in part
due to recent significant quality improvements. However, as recent work indicates, even …

[图书][B] Multilingual speech processing

T Schultz, K Kirchhoff - 2006 - books.google.com
Tanja Schultz and Katrin Kirchhoff have compiled a comprehensive overview of speech
processing from a multilingual perspective. By taking this all-inclusive approach to speech …

[图书][B] Multilingual information retrieval: From research to practice

C Peters, M Braschler, P Clough - 2012 - Springer
We are living in a multilingual world and the diversity in languages which are used to
interact with information access systems has generated a wide variety of challenges to be …

Spoken content retrieval: A survey of techniques and technologies

M Larson, GJF Jones - Foundations and Trends® in …, 2012 - nowpublishers.com
Speech media, that is, digital audio and video containing spoken content, has blossomed in
recent years. Large collections are accruing on the Internet as well as in private and …

Exploring capabilities of monolingual audio transformers using large datasets in automatic speech recognition of Czech

J Lehečka, J Švec, A Pražák, JV Psutka - arXiv preprint arXiv:2206.07627, 2022 - arxiv.org
In this paper, we present our progress in pretraining Czech monolingual audio transformers
from a large dataset containing more than 80 thousand hours of unlabeled speech, and …

Speechfind: Advances in spoken document retrieval for a national gallery of the spoken word

JHL Hansen, R Huang, B Zhou… - … on Speech and …, 2005 - ieeexplore.ieee.org
Advances in formulating spoken document retrieval for a new National Gallery of the Spoken
Word (NGSW) are addressed. NGSW is the first large-scale repository of its kind, consisting …

Adaptation of machine translation for multilingual information retrieval in the medical domain

P Pecina, O Dušek, L Goeuriot, J Hajič… - Artificial intelligence in …, 2014 - Elsevier
Objective We investigate machine translation (MT) of user search queries in the context of
cross-lingual information retrieval (IR) in the medical domain. The main focus is on …