Report on the 11th IWSLT evaluation campaign

M Cettolo, J Niehues, S Stüker… - Proceedings of the …, 2014 - aclanthology.org
The paper overviews the 11th evaluation campaign organized by the IWSLT workshop. The
2014 evaluation offered multiple tracks on lecture transcription and translation based on the …

The MGB challenge: Evaluating multi-genre broadcast media recognition

P Bell, MJF Gales, T Hain, J Kilgour… - … IEEE Workshop on …, 2015 - ieeexplore.ieee.org
This paper describes the Multi-Genre Broadcast (MGB) Challenge at ASRU 2015, an
evaluation focused on speech recognition, speaker diarization, and" lightly supervised" …

Multitask learning of context-dependent targets in deep neural network acoustic models

P Bell, P Swietojanski, S Renals - IEEE/ACM Transactions on …, 2016 - ieeexplore.ieee.org
This paper investigates the use of multitask learning to improve context-dependent deep
neural network (DNN) acoustic models. The use of hybrid DNN systems with clustered …

Edinburgh SLT and MT system description for the IWSLT 2014 evaluation

A Birch, M Huck, N Durrani, N Bogoychev… - Proceedings of the …, 2014 - aclanthology.org
This paper describes the University of Edinburgh's spoken language translation (SLT) and
machine translation (MT) systems for the IWSLT 2014 evaluation campaign. In the SLT track …

A semi-markov model for speech segmentation with an utterance-break prior

M Sinclair, P Bell, A Birch… - INTERSPEECH 2014 15th …, 2014 - research.ed.ac.uk
Speech segmentation is the problem of finding the end points of a speech utterance for
passing to an automatic speech recognition (ASR) system. The quality of this segmentation …

The UEDIN ASR systems for the IWSLT 2014 evaluation

P Bell, P Swietojanski, J Driesen… - Proceedings of the …, 2014 - aclanthology.org
This paper describes the University of Edinburgh (UEDIN) ASR systems for the 2014 IWSLT
Evaluation. Notable features of the English system include deep neural network acoustic …

[PDF][PDF] Feed forward pre-training for recurrent neural network language models

SR Gangireddy, F McInnes, S Renals - … Annual Conference of the …, 2014 - isca-archive.org
The recurrent neural network language model (RNNLM) has been demonstrated to
consistently reduce perplexities and automatic speech recognition (ASR) word error rates …

[PDF][PDF] Prosodically-enhanced recurrent neural network language models.

SR Gangireddy, S Renals, Y Nankaku, A Lee - INTERSPEECH, 2015 - isca-archive.org
Recurrent neural network language models have been shown to consistently reduce the
word error rates (WERs) of large vocabulary speech recognition tasks. In this work we …

CRIM and LIUM approaches for multi-genre broadcast media transcription

V Gupta, P Deléglise, G Boulianne… - … IEEE Workshop on …, 2015 - ieeexplore.ieee.org
The Multi-Genre Broadcast Challenge at ASRU 2015 is a controlled evaluation of speech
recognition, speaker diarization, and lightly supervised alignment using BBC TV recordings …

Speech segmentation and speaker diarisation for transcription and translation

M Sinclair - 2016 - era.ed.ac.uk
This dissertation outlines work related to Speech Segmentation–segmenting an audio
recording into regions of speech and non-speech, and Speaker Diarization–further …