The accented english speech recognition challenge 2020: open datasets, tracks, baselines, results and methods

X Shi, F Yu, Y Lu, Y Liang, Q Feng… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
The variety of accents has posed a big challenge to speech recognition. The Accented
English Speech Recognition Challenge (AESRC2020) is designed for providing a common …

E2E-based multi-task learning approach to joint speech and accent recognition

J Zhang, Y Peng, P Van Tung, H Xu, H Huang… - arXiv preprint arXiv …, 2021 - arxiv.org
In this paper, we propose a single multi-task learning framework to perform End-to-End
(E2E) speech recognition (ASR) and accent recognition (AR) simultaneously. The proposed …

Improvement of accent classification models through Grad-Transfer from Spectrograms and Gradient-weighted Class Activation Mapping

A Carofilis, E Alegre, E Fidalgo… - … on Audio, Speech …, 2023 - ieeexplore.ieee.org
Automatic accent classification is an active research field concerning speech processing. It
can be useful to identify a speaker's region of origin, which can be applied in police …

Aispeech-sjtu accent identification system for the accented english speech recognition challenge

H Huang, X Xiang, Y Yang, R Ma… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
This paper describes the AISpeech-SJTU system for the accent identification track of the
Interspeech-2020 Accented English Speech Recognition Challenge. In this challenge track …

[PDF][PDF] An End-to-End Dialect Identification System with Transfer Learning from a Multilingual Automatic Speech Recognition Model.

D Wang, S Ye, X Hu, S Li, X Xu - Interspeech, 2021 - researchgate.net
In this paper, we propose an end-to-end (E2E) dialect identification system trained using
transfer learning from a multilingual automatic speech recognition (ASR) model. This is also …

Text-conditioned transformer for automatic pronunciation error detection

Z Zhang, Y Wang, J Yang - Speech Communication, 2021 - Elsevier
Automatic pronunciation error detection (APED) plays an important role in the domain of
language learning. As for the previous ASR-based APED methods, the decoded results …

[HTML][HTML] A review of social background profiling of speakers from speech accents

MA Humayun, J Shuja, PE Abas - PeerJ Computer Science, 2024 - peerj.com
Social background profiling of speakers is heavily used in areas, such as, speech forensics,
and tuning speech recognition for accuracy improvement. This article provides a survey of …

A comprehensive Turkish accent/dialect recognition system using acoustic perceptual formants

Y Korkmaz, A Boyacı - Applied Acoustics, 2022 - Elsevier
Accent or dialect is one of the hot topics of emerging technology in speech processing. In an
audio recording, extracting accent clues from a speech signal can help investigators to have …

Vfnet: A convolutional architecture for accent classification

A Ahmed, P Tangri, A Panda, D Ramani… - 2019 IEEE 16th India …, 2019 - ieeexplore.ieee.org
Understanding accent is an issue which can derail any human-machine interaction. Accent
classification makes this task easier by identifying the accent being spoken by a person so …

Investigating the role of L1 in automatic pronunciation evaluation of L2 speech

M Tu, A Grabek, J Liss, V Berisha - arXiv preprint arXiv:1807.01738, 2018 - arxiv.org
Automatic pronunciation evaluation plays an important role in pronunciation training and
second language education. This field draws heavily on concepts from automatic speech …