Accent modification for speech recognition of non-native speakers using neural style transfer

AS Dhanjal, W Singh - Multimedia Tools and Applications, 2024 - Springer

The continuous development in Automatic Speech Recognition has grown and
demonstrated its enormous potential in Human Interaction Communication systems. It is …

被引用次数：40 相关文章

Voice-based interaction for an aging population: a systematic review

S Pednekar, P Dhirawani, R Shah… - 2023 3rd …, 2023 - ieeexplore.ieee.org

In the past twenty years, voice-based systems have emerged as a technology with high
usability and accessibility. Research in the field of Human-Computer Interaction (HCI) has …

被引用次数：6 相关文章

[PDF] arxiv.org

Speech technology for everyone: Automatic speech recognition for non-native english with transfer learning

T Shibano, X Zhang, MT Li, H Cho, P Sullivan… - arXiv preprint arXiv …, 2021 - arxiv.org

To address the performance gap of English ASR models on L2 English speakers, we
evaluate fine-tuning of pretrained wav2vec 2.0 models (Baevski et al., 2020; Xu et al., 2021) …

被引用次数：18 相关文章所有 4 个版本

[PDF] arxiv.org

CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice

J Zuluaga-Gomez, S Ahmed, D Visockas… - arXiv preprint arXiv …, 2023 - arxiv.org

Despite the recent advancements in Automatic Speech Recognition (ASR), the recognition
of accented speech still remains a dominant problem. In order to create more inclusive ASR …

被引用次数：11 相关文章所有 10 个版本

[PDF] arxiv.org

Improving automatic speech recognition for non-native English with transfer learning and language model decoding

P Sullivan, T Shibano, M Abdul-Mageed - Analysis and Application of …, 2022 - Springer

ASR systems designed for native English (L1) usually underperform on non-native English
(L2). To address this performance gap,(1) we extend our previous work to investigate fine …

被引用次数：16 相关文章所有 5 个版本

[PDF] ieee.org

Using Character-Level Sequence-to-Sequence Model for Word Level Text Generation to Enhance Arabic Speech Recognition

MA Azim, W Hussein, NL Badr - IEEE Access, 2023 - ieeexplore.ieee.org

Owing to the linguistic richness of the Arabic language, which contains more than 6000
roots, building a reliable Arabic language model for Arabic speech recognition systems …

被引用次数：4 相关文章所有 2 个版本

[PDF] u-aizu.ac.jp

CAPTuring accents: An approach to personalize pronunciation training for learners with different L1 backgrounds

V Khaustova, E Pyshkin, V Khaustov, J Blake… - … Conference on Speech …, 2023 - Springer

This paper presents a novel approach to addressing the often-overlooked issue of
pronunciation instruction in language learning through a Computer-Assisted Pronunciation …

被引用次数：4 相关文章所有 8 个版本

Data-driven personalisation of television content: a survey

L Nixon, J Foss, K Apostolidis, V Mezaris - Multimedia Systems, 2022 - Springer

This survey considers the vision of TV broadcasting where content is personalised and
personalisation is data-driven, looks at the AI and data technologies making this possible …

被引用次数：6 相关文章所有 3 个版本

Evaluation of the effectiveness of preschool English learning applications based on touch and voice multimodal interaction technique

TSM Tengku Wook, SF Mat Noor… - Universal Access in the …, 2023 - Springer

The rising development in information and communication technology affected the rapid
presence of new interface designs to meet the needs of user interaction. This includes the …

被引用次数：2 相关文章

[PDF] acm.org

Enhancing Communication Equity: Evaluation of an Automated Speech Recognition Application in Ghana

G Ayoka, G Barbareschi, R Cave… - Proceedings of the CHI …, 2024 - dl.acm.org

In Ghana people who struggle to articulate speech as a result of different conditions
experience barriers in interacting with others due to difficulties in being understood …

高级搜索

QQ 群