ViSQOL v3: An open source production ready objective speech and audio metric

M Chinen, FSC Lim, J Skoglund… - … on quality of …, 2020 - ieeexplore.ieee.org
Estimation of perceptual quality in audio and speech is possible using a variety of methods.
The combined v3 release of ViSQOL and ViSQOLAudio (for speech and audio …

CDPAM: Contrastive learning for perceptual audio similarity

P Manocha, Z Jin, R Zhang… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
Many speech processing methods based on deep learning require an automatic and
differentiable audio metric for the loss function. The DPAM approach of Manocha et al.[1] …

Objective measures of perceptual audio quality reviewed: An evaluation of their application domain dependence

M Torcoli, T Kastner, J Herre - IEEE/ACM Transactions on …, 2021 - ieeexplore.ieee.org
Over the past few decades, computational methods have been developed to estimate
perceptual audio quality. These methods, also referred to as objective quality measures, are …

ViSQOL: an objective speech quality model

A Hines, J Skoglund, AC Kokaram, N Harte - EURASIP Journal on Audio …, 2015 - Springer
This paper presents an objective speech quality model, ViSQOL, the Virtual Speech Quality
Objective Listener. It is a signal-based, full-reference, intrusive metric that models human …

NORESQA: A framework for speech quality assessment using non-matching references

P Manocha, B Xu, A Kumar - Advances in Neural …, 2021 - proceedings.neurips.cc
The perceptual task of speech quality assessment (SQA) is a challenging task for machines
to do. Objective SQA methods that rely on the availability of the corresponding clean …

A differentiable perceptual audio metric learned from just noticeable differences

P Manocha, A Finkelstein, R Zhang, NJ Bryan… - arXiv preprint arXiv …, 2020 - arxiv.org
Many audio processing tasks require perceptual assessment. The``gold standard``of
obtaining human judgments is time-consuming, expensive, and cannot be used as an …

Speech quality assessment through MOS using non-matching references

P Manocha, A Kumar - arXiv preprint arXiv:2206.12285, 2022 - arxiv.org
Human judgments obtained through Mean Opinion Scores (MOS) are the most reliable way
to assess the quality of speech signals. However, several recent attempts to automatically …

[HTML][HTML] ViSQOLAudio: An objective audio quality metric for low bitrate codecs

A Hines, E Gillen, D Kelly, J Skoglund… - The Journal of the …, 2015 - pubs.aip.org
Streaming services seek to optimise their use of bandwidth across audio and visual
channels to maximise the quality of experience for users. This letter evaluates whether …

A speech quality classifier based on tree-cnn algorithm that considers network degradations

ST Vieira, RL Rosa, DZ Rodríguez - Journal of Communications …, 2020 - hrcak.srce.hr
Sažetak Many factors can affect the users' quality of experience (QoE) in speech
communication services. The impairment factors appear due to physical phenomena that …

Audio similarity is unreliable as a proxy for audio quality

P Manocha, Z Jin, A Finkelstein - arXiv preprint arXiv:2206.13411, 2022 - arxiv.org
Many audio processing tasks require perceptual assessment. However, the time and
expense of obtaining``gold standard''human judgments limit the availability of such data …