You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
The current paper focuses on audio content management by means of joint audio segmentation and classification. We concentrate on the separation of typical audio classes, such as silence / background noise, speech, music and their combinations. A compact feature-vector subset is selected by a Correlation feature selection subset evaluation algorithm after the use of EM clustering algorithm on an initial audio data set. Time and spectral parameters are extracted using filter-banks and wavelets in combination with sliding windows and exponential moving averaging techniques. Features are extracted on a point-to-point basis, using the finest possible time resolution, so that each sample can be individually classified to one of the available groups. Clustering algorithms like EM or Simple K-means are tested to evaluate the final point-to-point classification result, therefore the joint audio detection-classification indexes. The extracted audio detection, segmentation and classification results can be incorporated into appropriate description schemes that would annotate audio events / segments for content description and management purposes.
Author (s): Vegiris, Christos;
Dimoulas, Charalampos;
Papanikolaou, George;
Affiliation:
Aristotle University of Thessaloniki, Thessaloniki, Greece
(See document for exact affiliation information.)
AES Convention: 126
Paper Number:7661
Publication Date:
2009-05-06
Session subject:
Recording, Reproduction, and Delivery
DOI:
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Vegiris, Christos; Dimoulas, Charalampos; Papanikolaou, George; 2009; Audio Content Annotation, Description, and Management Using Joint Audio Detection, Segmentation, and Classification Techniques [PDF]; Aristotle University of Thessaloniki, Thessaloniki, Greece; Paper 7661; Available from: https://aes.org/publications/elibrary-page/?id=14857
Vegiris, Christos; Dimoulas, Charalampos; Papanikolaou, George; Audio Content Annotation, Description, and Management Using Joint Audio Detection, Segmentation, and Classification Techniques [PDF]; Aristotle University of Thessaloniki, Thessaloniki, Greece; Paper 7661; 2009 Available: https://aes.org/publications/elibrary-page/?id=14857
@inproceedings{Vegiris2009audio,
title={{Audio Content Annotation, Description, and Management Using Joint Audio Detection, Segmentation, and Classification Techniques}},
author={Vegiris, Christos and Dimoulas, Charalampos and Papanikolaou, George},
year={2009},
month={may},
booktitle={Journal of the Audio Engineering Society},
publisher={Paper 7661; AES Convention 126; May 2009},
number={7661},
organization={AES},
}
TY – paper
TI – Audio Content Annotation, Description, and Management Using Joint Audio Detection, Segmentation, and Classification Techniques
AU – Vegiris, Christos
AU – Dimoulas, Charalampos
AU – Papanikolaou, George
PY – 2009
JO – Journal of the Audio Engineering Society
VL – 7661
Y1 – May 2009
Notifications