Robustness of speech quality metrics to background noise and network degradations: Comparing...

M Chinen, FSC Lim, J Skoglund… - … on quality of …, 2020 - ieeexplore.ieee.org

Estimation of perceptual quality in audio and speech is possible using a variety of methods.
The combined v3 release of ViSQOL and ViSQOLAudio (for speech and audio …

被引用次数：110 相关文章所有 9 个版本

[PDF] arxiv.org

CDPAM: Contrastive learning for perceptual audio similarity

P Manocha, Z Jin, R Zhang… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org

Many speech processing methods based on deep learning require an automatic and
differentiable audio metric for the loss function. The DPAM approach of Manocha et al.[1] …

被引用次数：64 相关文章所有 6 个版本

[PDF] ieee.org

Objective measures of perceptual audio quality reviewed: An evaluation of their application domain dependence

M Torcoli, T Kastner, J Herre - IEEE/ACM Transactions on …, 2021 - ieeexplore.ieee.org

Over the past few decades, computational methods have been developed to estimate
perceptual audio quality. These methods, also referred to as objective quality measures, are …

被引用次数：56 相关文章所有 7 个版本

[PDF] springer.com

ViSQOL: an objective speech quality model

A Hines, J Skoglund, AC Kokaram, N Harte - EURASIP Journal on Audio …, 2015 - Springer

This paper presents an objective speech quality model, ViSQOL, the Virtual Speech Quality
Objective Listener. It is a signal-based, full-reference, intrusive metric that models human …

被引用次数：146 相关文章所有 18 个版本

[PDF] neurips.cc

NORESQA: A framework for speech quality assessment using non-matching references

P Manocha, B Xu, A Kumar - Advances in Neural …, 2021 - proceedings.neurips.cc

The perceptual task of speech quality assessment (SQA) is a challenging task for machines
to do. Objective SQA methods that rely on the availability of the corresponding clean …

被引用次数：34 相关文章所有 8 个版本

[PDF] arxiv.org

A differentiable perceptual audio metric learned from just noticeable differences

P Manocha, A Finkelstein, R Zhang, NJ Bryan… - arXiv preprint arXiv …, 2020 - arxiv.org

Many audio processing tasks require perceptual assessment. The``gold standard``of
obtaining human judgments is time-consuming, expensive, and cannot be used as an …

被引用次数：74 相关文章所有 9 个版本

[PDF] arxiv.org

Speech quality assessment through MOS using non-matching references

P Manocha, A Kumar - arXiv preprint arXiv:2206.12285, 2022 - arxiv.org

Human judgments obtained through Mean Opinion Scores (MOS) are the most reliable way
to assess the quality of speech signals. However, several recent attempts to automatically …

被引用次数：18 相关文章所有 5 个版本

[HTML] aip.org

[HTML][HTML] ViSQOLAudio: An objective audio quality metric for low bitrate codecs

A Hines, E Gillen, D Kelly, J Skoglund… - The Journal of the …, 2015 - pubs.aip.org

Streaming services seek to optimise their use of bandwidth across audio and visual
channels to maximise the quality of experience for users. This letter evaluates whether …

被引用次数：60 相关文章所有 13 个版本

[PDF] srce.hr

A speech quality classifier based on tree-cnn algorithm that considers network degradations

ST Vieira, RL Rosa, DZ Rodríguez - Journal of Communications …, 2020 - hrcak.srce.hr

Sažetak Many factors can affect the users' quality of experience (QoE) in speech
communication services. The impairment factors appear due to physical phenomena that …

被引用次数：22 相关文章所有 9 个版本

[PDF] arxiv.org

Audio similarity is unreliable as a proxy for audio quality

P Manocha, Z Jin, A Finkelstein - arXiv preprint arXiv:2206.13411, 2022 - arxiv.org

Many audio processing tasks require perceptual assessment. However, the time and
expense of obtaining``gold standard''human judgments limit the availability of such data …

被引用次数：9 相关文章所有 7 个版本

高级搜索

QQ 群