You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
The most effective way of providing immersive sound effects is to use head-related transfer functions (HRTFs). HRTFs are defined by the path from a given sound source to the listener`s ears. However, sound propagation by HRTFs differs slightly between people because the head, body, and ears differ for each person. Recently, a method for estimating HRTFs using a neural network has been developed, where anthropometric pinna measurements and head-related impulse responses (HRIRs) are used as the input and output layer of the neural network. However, it is inefficient to accurately measure such anthropometric data. This paper proposes a feature extraction method for the ear image instead of measuring anthropometric pinna measurements directly. The proposed method utilizes the bottleneck features of a convolutional neural network (CNN) auto-encoder from the edge detected ear image. The proposed feature extraction method using the CNN-based auto-encoder will be incorporated into the HRTF estimation approach.
Author (s): Lee, Geon Woo;
Moon, Jung Min;
Chun, Chan Jun;
Kim, Hong Kook;
Affiliation:
Korea Institute of Civil Engineering and Building Technology (KICT), Goyang, Korea; Gwangju Institute of Science and Tech (GIST), Gwangju, Korea
(See document for exact affiliation information.)
AES Convention: 144
Paper Number:10023
Publication Date:
2018-05-06
Session subject:
Posters: Spatial Audio
DOI:
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Lee, Geon Woo; Moon, Jung Min; Chun, Chan Jun; Kim, Hong Kook; 2018; On the Use of Bottleneck Features of CNN Auto-Encoder for Personalized HRTFs [PDF]; Korea Institute of Civil Engineering and Building Technology (KICT), Goyang, Korea; Gwangju Institute of Science and Tech (GIST), Gwangju, Korea; Paper 10023; Available from: https://aes.org/publications/elibrary-page/?id=19419
Lee, Geon Woo; Moon, Jung Min; Chun, Chan Jun; Kim, Hong Kook; On the Use of Bottleneck Features of CNN Auto-Encoder for Personalized HRTFs [PDF]; Korea Institute of Civil Engineering and Building Technology (KICT), Goyang, Korea; Gwangju Institute of Science and Tech (GIST), Gwangju, Korea; Paper 10023; 2018 Available: https://aes.org/publications/elibrary-page/?id=19419
@inproceedings{Lee2018on,
title={{On the Use of Bottleneck Features of CNN Auto-Encoder for Personalized HRTFs}},
author={Lee, Geon Woo and Moon, Jung Min and Chun, Chan Jun and Kim, Hong Kook},
year={2018},
month={may},
booktitle={Journal of the Audio Engineering Society},
publisher={Paper 10023; AES Convention 144; May 2018},
number={10023},
organization={AES},
}
Notifications