You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
During the recent years, convolutional neural networks have been the standard on audio semantics, surpassing traditional classification approaches which employed hand-crafted feature engineering as front-end and various classifiers as back-end. Early studies were based on prominent 2D convolutional topologies for image recognition, adapting them to audio classification tasks. After the surge of deep learning in the past decade, real end-to-end audio learning, employing algorithms that directly process waveforms are to become the standard. This paper attempts a comparison between deep neural setups on typical audio classification tasks, focusing on optimizing 1D convolutional neural networks that can be deployed on various audio in-formation retrieval tasks, such as general audio detection and classification, environmental sound or speech emotion recognition.
Author (s): Vrysis, Lazaros;
Thoidis, Iordanis;
Dimoulas, Charalampos;
Papanikolaou, George;
Affiliation:
Aristotle University of Thessaloniki
(See document for exact affiliation information.)
AES Convention: 148
Paper Number:10329
Publication Date:
2020-05-06
Session subject:
Posters: Signal Processing
DOI:
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Vrysis, Lazaros; Thoidis, Iordanis; Dimoulas, Charalampos; Papanikolaou, George; 2020; Experimenting with 1D CNN Architectures for Generic Audio Classification [PDF]; Aristotle University of Thessaloniki; Paper 10329; Available from: https://aes.org/publications/elibrary-page/?id=20746
Vrysis, Lazaros; Thoidis, Iordanis; Dimoulas, Charalampos; Papanikolaou, George; Experimenting with 1D CNN Architectures for Generic Audio Classification [PDF]; Aristotle University of Thessaloniki; Paper 10329; 2020 Available: https://aes.org/publications/elibrary-page/?id=20746
@inproceedings{Vrysis2020experimenting,
title={{Experimenting with 1D CNN Architectures for Generic Audio Classification}},
author={Vrysis, Lazaros and Thoidis, Iordanis and Dimoulas, Charalampos and Papanikolaou, George},
year={2020},
month={may},
booktitle={Journal of the Audio Engineering Society},
publisher={Paper 10329; AES Convention 148; May 2020},
number={10329},
organization={AES},
}
Notifications