Domain adaptation for visual applications: A comprehensive survey

G Csurka - arXiv preprint arXiv:1702.05374, 2017 - arxiv.org
The aim of this paper is to give an overview of domain adaptation and transfer learning with
a specific view on visual applications. After a general motivation, we first position domain …

Deep learning for environmentally robust speech recognition: An overview of recent developments

Z Zhang, J Geiger, J Pohjalainen, AED Mousa… - ACM Transactions on …, 2018 - dl.acm.org
Eliminating the negative effect of non-stationary environmental noise is a long-standing
research topic for automatic speech recognition but still remains an important challenge …

Agnostic federated learning

M Mohri, G Sivek, AT Suresh - International Conference on …, 2019 - proceedings.mlr.press
A key learning scenario in large-scale applications is that of federated learning, where a
centralized model is trained based on data originating from a large number of clients. We …

An introduction to domain adaptation and transfer learning

WM Kouw, M Loog - arXiv preprint arXiv:1812.11806, 2018 - arxiv.org
In machine learning, if the training data is an unbiased sample of an underlying distribution,
then the learned classification function will make accurate predictions for new samples …

[PDF][PDF] Encoding time series as images for visual inspection and classification using tiled convolutional neural networks

Z Wang, T Oates - Workshops at the twenty-ninth AAAI conference on …, 2015 - cdn.aaai.org
Inspired by recent successes of deep learning in computer vision and speech recognition,
we propose a novel framework to encode time series data as different types of images …

Spoofing and countermeasures for speaker verification: A survey

Z Wu, N Evans, T Kinnunen, J Yamagishi, F Alegre… - speech …, 2015 - Elsevier
While biometric authentication has advanced significantly in recent years, evidence shows
the technology can be susceptible to malicious spoofing attacks. The research community …

Robust speech perception: recognize the familiar, generalize to the similar, and adapt to the novel.

DF Kleinschmidt, TF Jaeger - Psychological review, 2015 - psycnet.apa.org
Successful speech perception requires that listeners map the acoustic signal to linguistic
categories. These mappings are not only probabilistic, but change depending on the …

Geodesic flow kernel for unsupervised domain adaptation

B Gong, Y Shi, F Sha, K Grauman - 2012 IEEE conference on …, 2012 - ieeexplore.ieee.org
In real-world applications of visual recognition, many factors-such as pose, illumination, or
image quality-can cause a significant mismatch between the source domain on which …

The Kaldi speech recognition toolkit

D Povey, A Ghoshal, G Boulianne… - IEEE 2011 workshop …, 2011 - infoscience.epfl.ch
We describe the design of Kaldi, a free, open-source toolkit for speech recognition research.
Kaldi provides a speech recognition system based on finite-state transducers (using the …

Data augmentation for deep neural network acoustic modeling

X Cui, V Goel, B Kingsbury - IEEE/ACM Transactions on Audio …, 2015 - ieeexplore.ieee.org
This paper investigates data augmentation for deep neural network acoustic modeling
based on label-preserving transformations to deal with data sparsity. Two data …