[PDF][PDF] Speech recognition utilizing deep learning: A systematic review of the latest developments

D Al-Fraihat, Y Sharrab, F Alzyoud… - Human-centric …, 2024 - researchgate.net
Speech recognition is a natural language processing task that involves the computerized
transcription of spoken language in real time. Numerous studies have been conducted on …

An overview of bengali speech recognition: Methods, challenges, and future direction

N Tasnia, M Islam, MS Rony, N Tanzim… - 2023 IEEE 13th …, 2023 - ieeexplore.ieee.org
In the subject of human-computer interactions, speech recognition is an appealing
technique that gives users the opportunity to interact with and control the machine. Currently …

Cepstral and acoustic ternary pattern based hybrid feature extraction approach for end-to-end bangla speech recognition

M Dua, Akanksha, S Dua - Journal of Ambient Intelligence and Humanized …, 2023 - Springer
In the last three decades, a lot of work has been done for building Automatic Speech
Recognition (ASR) systems for well-established languages such as English, Chinese, etc …

BanSpeech: A Multi-domain Bangla Speech Recognition Benchmark Towards Robust Performance in Challenging Conditions

AM Samin, MH Kobir, MMS Rafee, MF Ahmed… - IEEE …, 2024 - ieeexplore.ieee.org
Despite huge improvements in automatic speech recognition (ASR) employing neural
networks, ASR systems still suffer from a lack of robustness and generalizability issues due …

Byte Pair Encoding Is All You Need For Automatic Bengali Speech Recognition

AM Samin - arXiv preprint arXiv:2401.15532, 2024 - arxiv.org
Byte pair encoding (BPE) emerges as an effective tokenization method for tackling the out-of-
vocabulary (OOV) challenge in various natural language and speech processing tasks …

BanglaDialecto: An End-to-End AI-Powered Regional Speech Standardization

MNS Samin, JI Ahad, TA Medha… - … Conference on Big …, 2024 - ieeexplore.ieee.org
This study focuses on recognizing Bangladeshi dialects and converting diverse Bengali
accents into standardized formal Bengali speech. Dialects, often referred to as regional …

基于带阈值的BPE-dropout 多任务学习的端到端语音识别.

马建, 朵琳, 韦贵香, 唐剑 - Journal of Jilin University …, 2024 - search.ebscohost.com
针对语音识别任务中出现的未登录词问题, 提出一种带阈值的BPE-dropout 多任务学习语音识别
方法. 该方法采用带随机性的字节对编码算法, 在形成子词时引入带字数阈值的策略 …

[HTML][HTML] Restoration of Ghost Imaging in Atmospheric Turbulence Based on Deep Learning

C Jiang, B Xu, L Zhang, D Zhang - Current Optics and Photonics, 2023 - coppjournal.org
Ghost imaging (GI) technology is developing rapidly, but there are inevitably some
limitations such as the influence of atmospheric turbulence. In this paper, we study a ghost …

Investigating self-supervised, weakly supervised and fully supervised training approaches for multi-domain automatic speech recognition: a study on Bangladeshi …

AM Samin, MH Kobir, MMS Rafee, MF Ahmed… - arXiv preprint arXiv …, 2022 - arxiv.org
Despite huge improvements in automatic speech recognition (ASR) employing neural
networks, ASR systems still suffer from a lack of robustness and generalizability issues due …

Silent voice: harnessing deep learning for lip-reading in Bangla

M Shaheen, AZ Ifti, A Hassan, J Hossain - 2024 - dspace.bracu.ac.bd
Understanding speech just through lip movement is known as lipreading. It is a crucial
component of interpersonal interactions. The majority of the previous initiatives attempted to …