You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
In this work we propose a deep learning based method—namely, variational, convolutional recurrent autoencoders (VCRAE)—for musical instrument synthesis. This method utilizes the higher level time-frequency representations extracted by the convolutional and recurrent layers to learn a Gaussian distribution in the training stage, which will be later used to infer unique samples through interpolation of multiple instruments in the usage stage. The reconstruction performance of VCRAE is evaluated by proxy through an instrument classifier and provides significantly better accuracy than two other baseline autoencoder methods. The synthesized samples for the combinations of 15 different instruments are available on the companion website.
Author (s): Çakir, Emre;
Virtanen, Tuomas;
Affiliation:
Tampere University of Technology, Tampere, Finland
(See document for exact affiliation information.)
AES Convention: 145
Paper Number:10035
Publication Date:
2018-10-06
Session subject:
Signal Processing—Part 1
DOI:
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Çakir, Emre; Virtanen, Tuomas; 2018; Musical Instrument Synthesis and Morphing in Multidimensional Latent Space Using Variational, Convolutional Recurrent Autoencoders [PDF]; Tampere University of Technology, Tampere, Finland; Paper 10035; Available from: https://aes.org/publications/elibrary-page/?id=19761
Çakir, Emre; Virtanen, Tuomas; Musical Instrument Synthesis and Morphing in Multidimensional Latent Space Using Variational, Convolutional Recurrent Autoencoders [PDF]; Tampere University of Technology, Tampere, Finland; Paper 10035; 2018 Available: https://aes.org/publications/elibrary-page/?id=19761
@inproceedings{Çakir2018musical,
title={{Musical Instrument Synthesis and Morphing in Multidimensional Latent Space Using Variational, Convolutional Recurrent Autoencoders}},
author={Çakir, Emre and Virtanen, Tuomas},
year={2018},
month={oct},
booktitle={Journal of the Audio Engineering Society},
publisher={Paper 10035; AES Convention 145; October 2018},
number={10035},
organization={AES},
}
Notifications