You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
This paper presents an adaptive prediction method about source-specific ranges of binaural cues, such as inter-channel level difference (ILD) and inter-channel phase difference (IPD), for centrally positioned singing voice separation. To this end, we employ Gaussian mixture model (GMM) to cluster underlying distributions in the feature domain of mixture signal. By regarding responsibilities to those distinct Gaussians as unmixing coefficients of each mixture spectrogram sample, the proposed method can reduce artificial deformations that previous center channel extraction methods usually suffer, caused by their imprecise or rough decision about ranges of central subspaces. Experiments on commercial music show superiority of the proposed method.
Author (s): Kim, Minje;
Beack, Seungkwon;
Choi, Keunwoo;
Kang, Kyeongok;
Affiliation:
Electronics and Telecommunications Research Institute (ETRI), Daejeon, Korea
(See document for exact affiliation information.)
Publication Date:
2011-09-06
Session subject:
Interactive Audio
DOI:
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Kim, Minje; Beack, Seungkwon; Choi, Keunwoo; Kang, Kyeongok; 2011; Gaussian Mixture Model for Singing Voice Separation from Stereophonic Music [PDF]; Electronics and Telecommunications Research Institute (ETRI), Daejeon, Korea; Paper 6-2; Available from: https://aes.org/publications/elibrary-page/?id=16121
Kim, Minje; Beack, Seungkwon; Choi, Keunwoo; Kang, Kyeongok; Gaussian Mixture Model for Singing Voice Separation from Stereophonic Music [PDF]; Electronics and Telecommunications Research Institute (ETRI), Daejeon, Korea; Paper 6-2; 2011 Available: https://aes.org/publications/elibrary-page/?id=16121
@inproceedings{Kim2011gaussian,
title={{Gaussian Mixture Model for Singing Voice Separation from Stereophonic Music}},
author={Kim, Minje and Beack, Seungkwon and Choi, Keunwoo and Kang, Kyeongok},
year={2011},
month={sep},
booktitle={Journal of the Audio Engineering Society},
publisher={Paper 6-2; AES Conference: 43rd International Conference: Audio for Wirelessly Networked Personal Devices; September 2011},
number={6-2},
organization={AES},
}
Notifications