AES E-Library

Room Geometry Estimation from Higher-Order Ambisonics Signals using Convolutional Recurrent Neural Networks

Knowledge of room geometry is a fundamental component for modeling acoustic environments. Since most common methods for room geometry estimation are based on prior knowledge, the generalization to unknown environments is somewhat limited. Deep learning based approaches have delivered promising results for the blind estimation of acoustic parameters considering mainly monaural signals. The purpose of this contribution is to investigate the effect of multichannel higher-order Ambisonics (HOA) signals on the performance of a convolutional recurrent neural network for blind room geometry estimation. Therefore a HOA-dataset of noisy speech signals in simulated rooms with realistic frequency-dependent reflection coefficients is introduced. Results show that for each additional Ambisonics order the estimation performance increases with the fourth-order model achieving a mean absolute error of 1.24 m averaged over all three room dimensions.

 

Author (s):
Affiliation: (See document for exact affiliation information.)
AES Convention: Paper Number:
Publication Date:
Session subject:

DOI:


Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Type:
16938
Choose your country of residence from this list: