You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
In this paper we present a novel framework for real-time speech/music discrimination (SMD). The proposed method improves the overall accuracy of automatically classifying the signals into speech, singing, or instrumental categories. In our work, first, we design several groups of classifiers such that each group’s classification decision is biased towards a certain class of sounds; the bias is induced by training different groups of classifiers on perceptual features extracted at different temporal resolutions. Then, we build our system using an ensemble of these biased classifiers organized in a parallel classification fashion. Last, these ensembles are combined with a weighting scheme, which can be tuned in either forward-weighting or inverse-weighting modes, to provide accurate results in real-time. We show, through extensive experimental evaluations, that the proposed ensemble of biased classifiers framework yields superior performance compared to the baseline approach.
Author (s): Kim, Kibeom;
Baijal, Anant;
Ko, Byeong-Seob;
Lee, Sangmoon;
Hwang, Inwoo;
Kim, Youngtae;
Affiliation:
Samsung Electronics Co. Ltd., Suwon, Gyeonggi-do, Korea
(See document for exact affiliation information.)
AES Convention: 139
Paper Number:9457
Publication Date:
2015-10-06
Session subject:
Applications in Audio
DOI:
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Kim, Kibeom; Baijal, Anant; Ko, Byeong-Seob; Lee, Sangmoon; Hwang, Inwoo; Kim, Youngtae; 2015; Speech Music Discrimination Using an Ensemble of Biased Classifiers [PDF]; Samsung Electronics Co. Ltd., Suwon, Gyeonggi-do, Korea; Paper 9457; Available from: https://aes.org/publications/elibrary-page/?id=18013
Kim, Kibeom; Baijal, Anant; Ko, Byeong-Seob; Lee, Sangmoon; Hwang, Inwoo; Kim, Youngtae; Speech Music Discrimination Using an Ensemble of Biased Classifiers [PDF]; Samsung Electronics Co. Ltd., Suwon, Gyeonggi-do, Korea; Paper 9457; 2015 Available: https://aes.org/publications/elibrary-page/?id=18013
@inproceedings{Kim2015speech,
title={{Speech Music Discrimination Using an Ensemble of Biased Classifiers}},
author={Kim, Kibeom and Baijal, Anant and Ko, Byeong-Seob and Lee, Sangmoon and Hwang, Inwoo and Kim, Youngtae},
year={2015},
month={oct},
booktitle={Journal of the Audio Engineering Society},
publisher={Paper 9457; AES Convention 139; October 2015},
number={9457},
organization={AES},
}
Notifications