AES Dublin 2019
Tutorial T19
Thursday, March 21, 16:15 — 17:30 (Liffey Hall 1)
T19 - Practical Deep Learning Introduction for Audio Processing Engineers
Presenter:Gabriele Bunkheila, MathWorks - Madrid, Spain
Are you an audio engineer working on product development or DSP algorithms and willing to integrate AI capabilities within your projects? In this session we will walk through a simple Deep Learning example for speech classification. We will use MATLAB code and a speech command dataset made available by Google. We will cover creating and accessing labeled data, using time-frequency transformations, extracting features, designing and training deep neural network architectures, and testing prototypes on real-time audio. We will also discuss working with other popular Deep Learning tools, including exploiting available pre-trained networks.
This session is presented in association with the AES Technical Committee on Semantic Audio Analysis |