You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
Conventional tempo estimation algorithms generally work by detecting significant audio events and finding periodicities of repetitive patterns in an audio signal. However, human perception of tempo is subjective, and relies on a far richer set of information, causing many tempo estimation algorithms to suffer from octave errors, or “double/half-time” confusion. In this paper, we propose a system that uses higher-level musical descriptors such as mood to train a statistical model of perceived tempo classes, which can then used to correct the estimate from a conventional tempo estimation algorithm. Our experimental results show reliable classification of perceived tempo class, as well as a significant reduction of octave errors when applied to an array of available tempo estimation algorithms.
Author (s): Chen, Ching-Wei;
Cremer, Markus;
Lee, Kyogu;
DiMaria, Peter;
Wu, Ho-Hsiang;
Affiliation:
Gracenote, Inc., Emeryville, CA, USA
(See document for exact affiliation information.)
AES Convention: 126
Paper Number:7777
Publication Date:
2009-05-06
Session subject:
Psychoacoustics and Perception
DOI:
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Chen, Ching-Wei; Cremer, Markus; Lee, Kyogu; DiMaria, Peter; Wu, Ho-Hsiang; 2009; Improving Perceived Tempo Estimation by Statistical Modeling of Higher-Level Musical Descriptors [PDF]; Gracenote, Inc., Emeryville, CA, USA; Paper 7777; Available from: https://aes.org/publications/elibrary-page/?id=14973
Chen, Ching-Wei; Cremer, Markus; Lee, Kyogu; DiMaria, Peter; Wu, Ho-Hsiang; Improving Perceived Tempo Estimation by Statistical Modeling of Higher-Level Musical Descriptors [PDF]; Gracenote, Inc., Emeryville, CA, USA; Paper 7777; 2009 Available: https://aes.org/publications/elibrary-page/?id=14973
@inproceedings{Chen2009improving,
title={{Improving Perceived Tempo Estimation by Statistical Modeling of Higher-Level Musical Descriptors}},
author={Chen, Ching-Wei and Cremer, Markus and Lee, Kyogu and DiMaria, Peter and Wu, Ho-Hsiang},
year={2009},
month={may},
booktitle={Journal of the Audio Engineering Society},
publisher={Paper 7777; AES Convention 126; May 2009},
number={7777},
organization={AES},
}
Notifications