You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
Intrusive audio quality models typically compare “internal representations” of a reference and a test signal. These models are often optimized for the prediction of small signal degradations, where the test and reference signals are still highly correlated (waveform preserving distortions). However, differences between uncorrelated signals like two Gaussian-noise tokens or, for example, more complex, realistic signals in spatial audio reproduction schemes, that show only partial correlation (non–waveform preserving distortions) are not necessarily easy to distinguish by listeners. Despite this, current audio quality models typically predict large perceptual differences between such signals. Here, the decision back-end of a reference-based audio quality model was modified to account for this overestimation of signal quality differences. The suggested modifications were intended to effectively mimic short-term memory limitations by analyzing similarities in the differences between the internal representations of reference and test signals across time frames, auditory channels, and modulation channels. The modified model was evaluated with data based on different audio reproduction and room simulation methods and was compared to other state-of-the-art audio quality models. Results support the need for modifications of state-of-the-art audio quality models to accurately predict the perceptual effects of non–waveform preserving distortions.
Author (s): Biberger, Thomas;
Fleßner, Jan-Hendrik;
Ewert, Stephan D.;
Affiliation:
Medizinische Physik and Cluster of Excellence Hearing4all, Universitat Oldenburg, Oldenburg, Germany; Medizinische Physik and Cluster of Excellence Hearing4all, Universität Oldenburg, Oldenburg, Germany; Hörtech gGmbH and Cluster of Excellence Hearing4All, Oldenburg, Germany; Medizinische Physik and Cluster of Excellence Hearing4All, Universitat Oldenburg, Oldenburg, Germany
(See document for exact affiliation information.)
Publication Date:
2025-11-10
DOI:

Biberger, Thomas; Fleßner, Jan-Hendrik; Ewert, Stephan D.; 2025; Audio Quality Prediction of Non–Waveform Preserving Distortions in Realistic Complex Scenes [PDF]; Medizinische Physik and Cluster of Excellence Hearing4all, Universitat Oldenburg, Oldenburg, Germany; Medizinische Physik and Cluster of Excellence Hearing4all, Universität Oldenburg, Oldenburg, Germany; Hörtech gGmbH and Cluster of Excellence Hearing4All, Oldenburg, Germany; Medizinische Physik and Cluster of Excellence Hearing4All, Universitat Oldenburg, Oldenburg, Germany; Paper ; Available from: https://aes.org/publications/elibrary-page/?id=23103
Biberger, Thomas; Fleßner, Jan-Hendrik; Ewert, Stephan D.; Audio Quality Prediction of Non–Waveform Preserving Distortions in Realistic Complex Scenes [PDF]; Medizinische Physik and Cluster of Excellence Hearing4all, Universitat Oldenburg, Oldenburg, Germany; Medizinische Physik and Cluster of Excellence Hearing4all, Universität Oldenburg, Oldenburg, Germany; Hörtech gGmbH and Cluster of Excellence Hearing4All, Oldenburg, Germany; Medizinische Physik and Cluster of Excellence Hearing4All, Universitat Oldenburg, Oldenburg, Germany; Paper ; 2025 Available: https://aes.org/publications/elibrary-page/?id=23103
@article{Biberger2025audio,
title={{Audio Quality Prediction of Non–Waveform Preserving Distortions in Realistic Complex Scenes}},
author={Biberger, Thomas and Fleßner, Jan-Hendrik and Ewert, Stephan D.},
year={2025},
month={jun},
journal={Journal of the Audio Engineering Society},
volume={73},
number={11},
pages={707-721},
}
Notifications