AES New York 2018
Product Development Track Event PD13

Friday, October 19, 3:00 pm — 4:15 pm (1E09)

Product Development: PD13 - Practical Deep Learning Introduction for Audio Processing Engineers

Presenter:
Gabriele Bunkheila, MathWorks - Madrid, Spain

Are you an audio engineer working on product development or DSP algorithms and willing to integrate AI capabilities within your projects? In this session we will walk through a simple Deep Learning example for speech classification. We will use MATLAB code and a speech command dataset made available by Google. We will cover creating and accessing labeled data, using time-frequency transformations, extracting features, designing and training deep neural network architectures, and testing prototypes on real-time audio. We will also discuss working with other popular Deep Learning tools, including exploiting available pre-trained networks.

This session is presented in association with the AES Technical Committee on Semantic Audio Analysis

Return to Product Development Track Events

145th AES CONVENTION Product Development Track Event PD13: Practical Deep Learning Introduction for Audio Processing Engineers

AES New York 2018
Product Development Track Event PD13

Product Development: PD13 - Practical Deep Learning Introduction for Audio Processing Engineers

Shortcuts

145th AES CONVENTION Product Development Track Event PD13: Practical Deep Learning Introduction for Audio Processing Engineers

AES New York 2018Product Development Track Event PD13

Product Development: PD13 - Practical Deep Learning Introduction for Audio Processing Engineers

Shortcuts

AES New York 2018
Product Development Track Event PD13