Audio Quality Prediction of Non–Waveform Preserving Distortions in Realistic Complex Scenes

Biberger, Thomas and Fleßner, Jan-Hendrik and Ewert, Stephan D.

AES E-Library

Audio Quality Prediction of Non–Waveform Preserving Distortions in Realistic Complex Scenes

Intrusive audio quality models typically compare “internal representations” of a reference and a test signal. These models are often optimized for the prediction of small signal degradations, where the test and reference signals are still highly correlated (waveform preserving distortions). However, differences between uncorrelated signals like two Gaussian-noise tokens or, for example, more complex, realistic signals in spatial audio reproduction schemes, that show only partial correlation (non–waveform preserving distortions) are not necessarily easy to distinguish by listeners. Despite this, current audio quality models typically predict large perceptual differences between such signals. Here, the decision back-end of a reference-based audio quality model was modified to account for this overestimation of signal quality differences. The suggested modifications were intended to effectively mimic short-term memory limitations by analyzing similarities in the differences between the internal representations of reference and test signals across time frames, auditory channels, and modulation channels. The modified model was evaluated with data based on different audio reproduction and room simulation methods and was compared to other state-of-the-art audio quality models. Results support the need for modifications of state-of-the-art audio quality models to accurately predict the perceptual effects of non–waveform preserving distortions.

Author (s): Biberger, Thomas; Fleßner, Jan-Hendrik; Ewert, Stephan D.;
Affiliation: Medizinische Physik and Cluster of Excellence Hearing4all, Universitat Oldenburg, Oldenburg, Germany; Medizinische Physik and Cluster of Excellence Hearing4all, Universität Oldenburg, Oldenburg, Germany; Hörtech gGmbH and Cluster of Excellence Hearing4All, Oldenburg, Germany; Medizinische Physik and Cluster of Excellence Hearing4All, Universitat Oldenburg, Oldenburg, Germany (See document for exact affiliation information.)
Publication Date: 2025-11-10

DOI:

Type: Journal Article

AES Conventions

AES Conferences

AES Training & Development

AES Inside Track

Journal of the AES

AES E-library

Special Publications

AES Sections are active around the world and provide a means for members to meet locally.

AES Student Website

AES Educational Foundation

Student Sections

See the committee’s accomplishments in diversity & inclusion

AES Statement of solidarity

Richard C. Heyser Memorial Lecture Series

AES E-Library

Audio Quality Prediction of Non–Waveform Preserving Distortions in Realistic Complex Scenes

Choose your country of residence from this list:

AES E-Library

Login Institutions

Audio Quality Prediction of Non–Waveform Preserving Distortions in Realistic Complex Scenes

Choose your country of residence from this list: