You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
This paper proposes a novel direct source and first-order reflections localization method by integrating the high order Ambisonics (HOA) algorithm and deep neural network. We use the covariance matrix of HOA signals in the time domain as the input feature of the network, which contains precise spatial information of the sound sources under reverberant scenarios. Besides, we use the deconvolution-based neural network (DCNN) for the spatial pseudo-spectrum (SPS) reconstruction, based on which the spatial relationship between elevation and azimuth can be depicted. Considering that the first-order reflections of the sound source also contain spatial directivity like the direct source, we treat both of them as the sources in the learning process. We have carried out a series of experiments based on simulated and measured data under different reverberant scenarios, which prove the effectiveness and accuracy of the proposed DCNN model.
Author (s): Gao, Shan;
Wu, Xihong;
Qu, Tianshu;
Affiliation:
Key Laboratory on Machine Perception (Ministry of Education), School of Artificial Intelligence, Peking University, Beijing, China
(See document for exact affiliation information.)
AES Convention: 152
Paper Number:10560
Publication Date:
2022-05-06
Session subject:
Spatial Audio
DOI:
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Gao, Shan; Wu, Xihong; Qu, Tianshu; 2022; Localization of Direct Source and Early Reflections Using HOA Processing and DNN Model [PDF]; Key Laboratory on Machine Perception (Ministry of Education), School of Artificial Intelligence, Peking University, Beijing, China; Paper 10560; Available from: https://aes.org/publications/elibrary-page/?id=21673
Gao, Shan; Wu, Xihong; Qu, Tianshu; Localization of Direct Source and Early Reflections Using HOA Processing and DNN Model [PDF]; Key Laboratory on Machine Perception (Ministry of Education), School of Artificial Intelligence, Peking University, Beijing, China; Paper 10560; 2022 Available: https://aes.org/publications/elibrary-page/?id=21673
@inproceedings{Gao2022localization,
title={{Localization of Direct Source and Early Reflections Using HOA Processing and DNN Model}},
author={Gao, Shan and Wu, Xihong and Qu, Tianshu},
year={2022},
month={may},
booktitle={Journal of the Audio Engineering Society},
publisher={Paper 10560; AES Convention 152; May 2022},
number={10560},
organization={AES},
}
Notifications