You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
Sound source localization and identity tracking are fundamental tasks in acoustic scene analysis, enabling machines to determine what, where, and when sound events occur. While deep attractor-based networks have demonstrated improved performance under an unknown number of sources, maintaining continuous source tracking over longform audio remains challenging due to memory limitations and permutation ambiguities across adjacent segments. In this paper, we propose a Recursive Attractor Network (RANet) for long-form sound source localization
and identity tracking with a variable number of sources. RANet explicitly represents attractors as transferable embeddings and recursively propagates them across adjacent audio segments using a LSTM-based model, thereby preserving source identity continuity over time. Experimental results on simulated datasets demonstrate that RANet achieves robust long-form localization and consistent source identity tracking, outperforming baseline approaches.
Author (s): Du, Jiaqi;
Wu, Xihong;
Qu, Tianshu;
Affiliation:
State Key Laboratory of General Artificial Intelligence, School of Intelligence Science and Technology, Peking University
(See document for exact affiliation information.)
AES Convention: 160
Paper Number:10271
Publication Date:
2026-05-28
Session subject:
AI and Machine Learning in Audio, Audio Processing
DOI:
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Du, Jiaqi; Wu, Xihong; Qu, Tianshu; 2026; A Recursive Attractor Network for Long-Form Sound Source Localization and Identity Tracking with a Variable Number of Sources [PDF]; State Key Laboratory of General Artificial Intelligence, School of Intelligence Science and Technology, Peking University; Paper 10271; Available from: https://aes.org/publications/elibrary-page/?id=23183
Du, Jiaqi; Wu, Xihong; Qu, Tianshu; A Recursive Attractor Network for Long-Form Sound Source Localization and Identity Tracking with a Variable Number of Sources [PDF]; State Key Laboratory of General Artificial Intelligence, School of Intelligence Science and Technology, Peking University; Paper 10271; 2026 Available: https://aes.org/publications/elibrary-page/?id=23183
@inproceedings{Du2026a,
title={{A Recursive Attractor Network for Long-Form Sound Source Localization and Identity Tracking with a Variable Number of Sources}},
author={Du, Jiaqi and Wu, Xihong and Qu, Tianshu},
year={2026},
month={jun},
booktitle={Journal of the Audio Engineering Society},
publisher={},
number={10271},
organization={AES},
}
Notifications