Maximum likelihood linear regression for speaker adaptation of continuous density hidden...

G Csurka - arXiv preprint arXiv:1702.05374, 2017 - arxiv.org

The aim of this paper is to give an overview of domain adaptation and transfer learning with
a specific view on visual applications. After a general motivation, we first position domain …

被引用次数：620 相关文章所有 3 个版本

[PDF] arxiv.org

Deep learning for environmentally robust speech recognition: An overview of recent developments

Z Zhang, J Geiger, J Pohjalainen, AED Mousa… - ACM Transactions on …, 2018 - dl.acm.org

Eliminating the negative effect of non-stationary environmental noise is a long-standing
research topic for automatic speech recognition but still remains an important challenge …

被引用次数：389 相关文章所有 10 个版本

[PDF] mlr.press

Agnostic federated learning

M Mohri, G Sivek, AT Suresh - International Conference on …, 2019 - proceedings.mlr.press

A key learning scenario in large-scale applications is that of federated learning, where a
centralized model is trained based on data originating from a large number of clients. We …

被引用次数：931 相关文章所有 6 个版本

[PDF] arxiv.org

An introduction to domain adaptation and transfer learning

WM Kouw, M Loog - arXiv preprint arXiv:1812.11806, 2018 - arxiv.org

In machine learning, if the training data is an unbiased sample of an underlying distribution,
then the learned classification function will make accurate predictions for new samples …

被引用次数：389 相关文章所有 7 个版本

[PDF] aaai.org

[PDF][PDF] Encoding time series as images for visual inspection and classification using tiled convolutional neural networks

Z Wang, T Oates - Workshops at the twenty-ninth AAAI conference on …, 2015 - cdn.aaai.org

Inspired by recent successes of deep learning in computer vision and speech recognition,
we propose a novel framework to encode time series data as different types of images …

被引用次数：627 相关文章所有 5 个版本

[PDF] drwuz.com

Spoofing and countermeasures for speaker verification: A survey

Z Wu, N Evans, T Kinnunen, J Yamagishi, F Alegre… - speech …, 2015 - Elsevier

While biometric authentication has advanced significantly in recent years, evidence shows
the technology can be susceptible to malicious spoofing attacks. The research community …

被引用次数：711 相关文章所有 13 个版本

[PDF] apa.org

Robust speech perception: recognize the familiar, generalize to the similar, and adapt to the novel.

DF Kleinschmidt, TF Jaeger - Psychological review, 2015 - psycnet.apa.org

Successful speech perception requires that listeners map the acoustic signal to linguistic
categories. These mappings are not only probabilistic, but change depending on the …

被引用次数：673 相关文章所有 13 个版本

[PDF] idiap.ch

Geodesic flow kernel for unsupervised domain adaptation

B Gong, Y Shi, F Sha, K Grauman - 2012 IEEE conference on …, 2012 - ieeexplore.ieee.org

In real-world applications of visual recognition, many factors-such as pose, illumination, or
image quality-can cause a significant mismatch between the source domain on which …

被引用次数：2771 相关文章所有 18 个版本

[PDF] epfl.ch

The Kaldi speech recognition toolkit

D Povey, A Ghoshal, G Boulianne… - IEEE 2011 workshop …, 2011 - infoscience.epfl.ch

We describe the design of Kaldi, a free, open-source toolkit for speech recognition research.
Kaldi provides a speech recognition system based on finite-state transducers (using the …

被引用次数：7341 相关文章所有 27 个版本

Data augmentation for deep neural network acoustic modeling

X Cui, V Goel, B Kingsbury - IEEE/ACM Transactions on Audio …, 2015 - ieeexplore.ieee.org

This paper investigates data augmentation for deep neural network acoustic modeling
based on label-preserving transformations to deal with data sparsity. Two data …

被引用次数：528 相关文章所有 13 个版本

高级搜索

QQ 群