You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
This paper proposes a sound source separation method using image signal processing and a microphone array. First, a spatio-temporal sound pressure distribution (STSPD) image is formed based on microphone outputs. Two-dimensional fast Fourier transform (2D FFT) transforms this image into a spectrum, in which sounds from different directions are separated into the components on different lines naturally. To separate sound sources, every line in the spectrum is extracted and 2D inverse FFT is applied. A method to restore a ?ne STSPD image from the sparse-microphone array is also proposed. Although the basic performance of the proposed method is comparable to a conventional delay and sum array, methods that are more sophisticated can be applied for improved performance.
Author (s): Ozawa, Kenji;
Ito, Masaaki;
Shimizu, Genya;
Morise, Masanori;
Sakamoto, Shuichi;
Affiliation:
University of Yamanashi, Kofu, Yamanashi, Japan; Tohoku University, Sendai, Japan
(See document for exact affiliation information.)
Publication Date:
2018-07-06
Session subject:
Spatio-temporal sound pressure distribution image; Image signal processing; Two-dimensional fast Fourier transform; Sparse modeling using L1 regularization (Lasso)
DOI:
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Ozawa, Kenji; Ito, Masaaki; Shimizu, Genya; Morise, Masanori; Sakamoto, Shuichi; 2018; Proposal of a Sound Source Separation Method Using Image Signal Processing of a Spatio-Temporal Sound Pressure Distribution Image [PDF]; University of Yamanashi, Kofu, Yamanashi, Japan; Tohoku University, Sendai, Japan; Paper PP-6; Available from: https://aes.org/publications/elibrary-page/?id=19614
Ozawa, Kenji; Ito, Masaaki; Shimizu, Genya; Morise, Masanori; Sakamoto, Shuichi; Proposal of a Sound Source Separation Method Using Image Signal Processing of a Spatio-Temporal Sound Pressure Distribution Image [PDF]; University of Yamanashi, Kofu, Yamanashi, Japan; Tohoku University, Sendai, Japan; Paper PP-6; 2018 Available: https://aes.org/publications/elibrary-page/?id=19614
@inproceedings{Ozawa2018proposal,
title={{Proposal of a Sound Source Separation Method Using Image Signal Processing of a Spatio-Temporal Sound Pressure Distribution Image}},
author={Ozawa, Kenji and Ito, Masaaki and Shimizu, Genya and Morise, Masanori and Sakamoto, Shuichi},
year={2018},
month={jul},
booktitle={Journal of the Audio Engineering Society},
publisher={Paper PP-6; AES Conference: 2018 AES International Conference on Spatial Reproduction - Aesthetics and Science; July 2018},
number={PP-6},
organization={AES},
}
TY – paper
TI – Proposal of a Sound Source Separation Method Using Image Signal Processing of a Spatio-Temporal Sound Pressure Distribution Image
AU – Ozawa, Kenji
AU – Ito, Masaaki
AU – Shimizu, Genya
AU – Morise, Masanori
AU – Sakamoto, Shuichi
PY – 2018
JO – Journal of the Audio Engineering Society
VL – PP-6
Y1 – July 2018
Notifications