You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
In this paper we study the matter of perceptual evaluation data collection for the purposes of machine learning. Well established listening test methods have been developed and standardised in the audio community over many years. This papers looks at the specific needs for machine learning and seeks to establish efficient data collection methods, that address the requirements of machine learning, whilst also providing robust and repeatable perceptual evaluation results. Following a short review of efficient data collection techniques, including the concept of data augmentation and introduce the new concept of pre-augmentation as an alternative efficient data collection approach. Multiple stimulus presentation style listening tests are then presented for the evaluation of a wide range of audio quality devices (headphones) evaluated by a panel of trained expert assessors. Two tests are presented using a traditional full factorial design and a pre-augmented design to enable the performance comparison of these two approaches. The two approaches are statistically analysed and discussed. Finally, the performance of the two approaches for building machine learning models are reviewed, comparing the performance of a range of baseline models.
Author (s): Volk, Christer P.;
Nordby, Jon;
Stegenborg-Andersen, Tore;
Zacharov, Nick;
Affiliation:
FORCE Technology, SenseLab, Hørsholm, Denmark; Soundsensing, Oslo, Norway
(See document for exact affiliation information.)
Publication Date:
2020-08-06
Session subject:
Audio Quality/Standards
DOI:
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Volk, Christer P.; Nordby, Jon; Stegenborg-Andersen, Tore; Zacharov, Nick; 2020; Efficient data collection pipeline for audio machine learning of audio quality [PDF]; FORCE Technology, SenseLab, Hørsholm, Denmark; Soundsensing, Oslo, Norway; Paper 10488; Available from: https://aes.org/publications/elibrary-page/?id=21165
Volk, Christer P.; Nordby, Jon; Stegenborg-Andersen, Tore; Zacharov, Nick; Efficient data collection pipeline for audio machine learning of audio quality [PDF]; FORCE Technology, SenseLab, Hørsholm, Denmark; Soundsensing, Oslo, Norway; Paper 10488; 2020 Available: https://aes.org/publications/elibrary-page/?id=21165
@inproceedings{Volk2020efficient,
title={{Efficient data collection pipeline for audio machine learning of audio quality}},
author={Volk, Christer P. and Nordby, Jon and Stegenborg-Andersen, Tore and Zacharov, Nick},
year={2020},
month={aug},
booktitle={Journal of the Audio Engineering Society},
publisher={Paper 10488; AES Conference: 2020 AES International Conference on Audio for Virtual and Augmented Reality (August 2020); August 2020},
number={10488},
organization={AES},
}
Notifications