You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
The growing usage of machine learning and artificial intelligence within audio signal processing underscores the significance of high-quality audio datasets for advancing audio algorithms such as speech enhancement, echo cancellation, de-reverberation, and blind room estimation. However, the conventional approaches of collecting such data present various limitations. Measurement based approaches are costly and time-consuming, and synthetic data generation using standard acoustics simulation methodology has been shown to generalize poorly to real world scenarios, due to limitations in capturing the intricacies of real world room acoustics. In this paper, we present a framework that offers a solution to the challenges associated with dataset creation, enabling the efficient production of extensive datasets that closely mimic real-world audio scenes, thereby enhancing the efficacy of machine learning models. Through the lens of a specific use-case illustration, we highlight the integration of a hybrid wave-based / geometrical acoustics simulation for dataset generation. Notably, our focus extends to accurate device modeling—a critical aspect for the development of multiple-microphone devices and subsequent refinement of machine learning algorithms. We illustrate how the dataset accuracy surpasses the standard limitations of geometrical acoustics simulations. We present analysis of the computational performance of the system and we demonstrate examples of improved machine learning performance using the data.
Author (s): Driscoll, Erin;
Cosnefroy, Matthias;
Hafsteinsson, Haukur;
Stefánsson, Jón;
Pind, Finnur;
Affiliation:
Treble Technologies; Treble Technologies; Treble Technologies; Treble Technologies; Treble Technologies
(See document for exact affiliation information.)
AES Convention: 155
Paper Number:177
Publication Date:
2023-10-06
Session subject:
Room Acoustics
DOI:
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Driscoll, Erin; Cosnefroy, Matthias; Hafsteinsson, Haukur; Stefánsson, Jón; Pind, Finnur; 2023; Data generation with device-modeling using Treble’s hybrid cloud-based system [PDF]; Treble Technologies; Treble Technologies; Treble Technologies; Treble Technologies; Treble Technologies; Paper 177; Available from: https://aes.org/publications/elibrary-page/?id=22331
Driscoll, Erin; Cosnefroy, Matthias; Hafsteinsson, Haukur; Stefánsson, Jón; Pind, Finnur; Data generation with device-modeling using Treble’s hybrid cloud-based system [PDF]; Treble Technologies; Treble Technologies; Treble Technologies; Treble Technologies; Treble Technologies; Paper 177; 2023 Available: https://aes.org/publications/elibrary-page/?id=22331
@inproceedings{Driscoll2023data,
title={{Data generation with device-modeling using Treble’s hybrid
cloud-based system}},
author={Driscoll, Erin and Cosnefroy, Matthias and Hafsteinsson, Haukur and Stefánsson, Jón and Pind, Finnur},
year={2023},
month={oct},
booktitle={Journal of the Audio Engineering Society},
publisher={Express Paper 177; AES Convention 155; October 2023},
number={177},
organization={AES},
}
Notifications