AES E-Library

Automatic Soundscape Classification via Comparative Psychometrics and Machine Learning

Computational acoustical ecology is a relatively new field in which long-term environmental recordings are mined for meaningful data. Humans quite naturally and automatically associate environmental sounds with emotions and can easily identify the components of a soundscape. However, equipping a computer to accurately and automatically rate unknown environmental recordings along subjective psychoacoustic di-mensions, let alone report the environment (e.g., beach, barnyard, home kitchen, research lab, etc.) in which the environmental recordings were made with a high degree of accuracy is quite difficult. We present here a robust algorithm for automatic soundscape classification in which both psychometric data and computed audio features are compared and used to train a Naive Bayesian classifier. An algorithm for classifying the type of soundscape across different categories was developed. In a pilot test, automatic classification accuracy of 88% was achieved on 20 soundscapes, and the classifier was able to outperform human ratings in some tests. In a second test, classification accuracy of 95% was achieved on 30 soundscapes.

 

Author (s):
Affiliation: (See document for exact affiliation information.)
AES Convention: Paper Number:
Publication Date:
Session subject:

DOI:


Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Type:
16938
Choose your country of residence from this list: