An attention enhanced multi-task model for objective speech assessment in real-world environments

A Mehrish, N Majumder, R Bharadwaj, R Mihalcea… - Information …, 2023 - Elsevier

The field of speech processing has undergone a transformative shift with the advent of deep
learning. The use of multiple processing layers has enabled the creation of models capable …

被引用次数：215 相关文章所有 6 个版本

[PDF] ieee.org Full View

Icassp 2023 deep noise suppression challenge

H Dubey, A Aazami, V Gopal, B Naderi… - IEEE Open Journal …, 2024 - ieeexplore.ieee.org

The ICASSP 2023 Deep Noise Suppression (DNS) Challenge marks the fifth edition of the
DNS challenge series. DNS challenges were organized from 2019 to 2023 to foster …

被引用次数：229 相关文章所有 14 个版本

[PDF] arxiv.org

DNSMOS: A non-intrusive perceptual objective speech quality metric to evaluate noise suppressors

CKA Reddy, V Gopal, R Cutler - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org

Human subjective evaluation is the" gold standard" to evaluate speech quality optimized for
human perception. Perceptual objective metrics serve as a proxy for subjective scores. The …

被引用次数：299 相关文章所有 4 个版本

[PDF] arxiv.org

NISQA: A deep CNN-self-attention model for multidimensional speech quality prediction with crowdsourced datasets

G Mittag, B Naderi, A Chehadi, S Möller - arXiv preprint arXiv:2104.09494, 2021 - arxiv.org

In this paper, we present an update to the NISQA speech quality prediction model that is
focused on distortions that occur in communication networks. In contrast to the previous …

被引用次数：239 相关文章所有 6 个版本

[PDF] arxiv.org

DNSMOS P. 835: A non-intrusive perceptual objective speech quality metric to evaluate noise suppressors

CKA Reddy, V Gopal, R Cutler - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org

Human subjective evaluation is the" gold standard" to evaluate speech quality optimized for
human perception. Perceptual objective metrics serve as a proxy for subjective scores. We …

被引用次数：206 相关文章所有 3 个版本

[PDF] ieee.org

Deep learning-based non-intrusive multi-objective speech assessment model with cross-domain features

RE Zezario, SW Fu, F Chen, CS Fuh… - … on Audio, Speech …, 2022 - ieeexplore.ieee.org

This study proposes a cross-domain multi-objective speech assessment model, called
MOSA-Net, which can simultaneously estimate the speech quality, intelligibility, and …

被引用次数：85 相关文章所有 7 个版本

[PDF] neurips.cc

NORESQA: A framework for speech quality assessment using non-matching references

P Manocha, B Xu, A Kumar - Advances in neural …, 2021 - proceedings.neurips.cc

The perceptual task of speech quality assessment (SQA) is a challenging task for machines
to do. Objective SQA methods that rely on the availability of the corresponding clean …

被引用次数：48 相关文章所有 8 个版本

[PDF] arxiv.org

Torchaudio-squim: Reference-less speech quality and intelligibility measures in torchaudio

A Kumar, K Tan, Z Ni, P Manocha… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org

Measuring quality and intelligibility of a speech signal is usually a critical step in
development of speech processing systems. To enable this, a variety of metrics to measure …

被引用次数：40 相关文章所有 3 个版本

[PDF] arxiv.org

A study on incorporating Whisper for robust speech assessment

RE Zezario, YW Chen, SW Fu, Y Tsao… - … on Multimedia and …, 2024 - ieeexplore.ieee.org

This research introduces an enhanced version of the multi-objective speech assessment
model–MOSA-Net+, by leveraging the acoustic features from Whisper, a large-scaled …

被引用次数：13 相关文章所有 4 个版本

[PDF] arxiv.org

Metricnet: Towards improved modeling for non-intrusive speech quality assessment

M Yu, C Zhang, Y Xu, S Zhang, D Yu - arXiv preprint arXiv:2104.01227, 2021 - arxiv.org

The objective speech quality assessment is usually conducted by comparing received
speech signal with its clean reference, while human beings are capable of evaluating the …

被引用次数：34 相关文章所有 6 个版本

高级搜索

QQ 群