Recent advances in convolutional neural networks

J Gu, Z Wang, J Kuen, L Ma, A Shahroudy, B Shuai… - Pattern recognition, 2018 - Elsevier
In the last few years, deep learning has led to very good performance on a variety of
problems, such as visual recognition, speech recognition and natural language processing …

Deep learning for audio signal processing

H Purwins, B Li, T Virtanen, J Schlüter… - IEEE Journal of …, 2019 - ieeexplore.ieee.org
Given the recent surge in developments of deep learning, this paper provides a review of the
state-of-the-art deep learning techniques for audio signal processing. Speech, music, and …

A simple convolutional generative network for next item recommendation

F Yuan, A Karatzoglou, I Arapakis, JM Jose… - Proceedings of the twelfth …, 2019 - dl.acm.org
Convolutional Neural Networks (CNNs) have been recently introduced in the domain of
session-based next item recommendation. An ordered collection of past items the user has …

Temporal convolutional networks for the advance prediction of ENSO

J Yan, L Mu, L Wang, R Ranjan, AY Zomaya - Scientific reports, 2020 - nature.com
Abstract El Niño-Southern Oscillation (ENSO), which is one of the main drivers of Earth's
inter-annual climate variability, often causes a wide range of climate anomalies, and the …

English conversational telephone speech recognition by humans and machines

G Saon, G Kurata, T Sercu, K Audhkhasi… - arXiv preprint arXiv …, 2017 - arxiv.org
One of the most difficult speech recognition tasks is accurate recognition of human to human
communication. Advances in deep learning over the last few years have produced major …

Deep insight: Convolutional neural network and its applications for COVID-19 prognosis

NY Khanday, SA Sofi - Biomedical Signal Processing and Control, 2021 - Elsevier
Background and objective SARS-CoV-2, a novel strain of coronavirus' also called
coronavirus disease 19 (COVID-19), a highly contagious pathogenic respiratory viral …

Exploring neural transducers for end-to-end speech recognition

E Battenberg, J Chen, R Child, A Coates… - 2017 IEEE automatic …, 2017 - ieeexplore.ieee.org
In this work, we perform an empirical comparison among the CTC, RNN-Transducer, and
attention-based Seq2Seq models for end-to-end speech recognition. We show that, without …

Gated residual networks with dilated convolutions for monaural speech enhancement

K Tan, J Chen, DL Wang - IEEE/ACM transactions on audio …, 2018 - ieeexplore.ieee.org
For supervised speech enhancement, contextual information is important for accurate mask
estimation or spectral mapping. However, commonly used deep neural networks (DNNs) are …

Recent progresses in deep learning based acoustic models

D Yu, J Li - IEEE/CAA Journal of automatica sinica, 2017 - ieeexplore.ieee.org
In this paper, we summarize recent progresses made in deep learning based acoustic
models and the motivation and insights behind the surveyed techniques. We first discuss …

Convolutional neural networks for crowd behaviour analysis: a survey

G Tripathi, K Singh, DK Vishwakarma - The Visual Computer, 2019 - Springer
Interest in automatic crowd behaviour analysis has grown considerably in the last few years.
Crowd behaviour analysis has become an integral part all over the world for ensuring …