Head-Related Transfer Function Upsampling Using an Autoencoder-Based Generative Adversarial Network With Evaluation Framework

Hu, Xuyi and Li, Jian and Picinali, Lorenzo and Hogg, Aidan O. T.

AES E-Library

Head-Related Transfer Function Upsampling Using an Autoencoder-Based Generative Adversarial Network With Evaluation Framework

Accurate head-related transfer functions (HRTFs) are essential for delivering realistic 3D audio experiences. However, obtaining personalized, high-resolution HRTFs for individual users is a time-consuming and costly process, typically requiring extensive acoustic measurements. To address this, spatial upsampling techniques have been developed to estimate high-resolution HRTFs from sparse, low-resolution acoustic measurements. This paper presents a novel approach that leverages the spherical harmonic domain and an autoencoder generative adversarial network to tackle the HRTF upsampling problem. Comprehensive evaluations are conducted using both perceptual models and objective spectral metrics to validate the accuracy and realism of the upsampled HRTFs. The results show that the proposed approach outperforms traditional barycentric interpolation in terms of log-spectral distortion, particularly in extreme sparsity scenarios involving fewer than 12 measurements. These results go some way to justifying that the proposed autoencoder generative adversarial network approach is able to create high-quality, high-resolution HRTFs from only a few acoustic measurements, helping pave the way for more accessible personalized spatial audio across a range of applications.

Author (s): Hu, Xuyi; Li, Jian; Picinali, Lorenzo; Hogg, Aidan O. T.;
Affiliation: Audio Experience Design, - www.axdesign.co.uk, Dyson School of Design Engineering, Imperial College London, London, UK; Audio Experience Design, - www.axdesign.co.uk, Dyson School of Design Engineering, Imperial College London, London, UK; Audio Experience Design, - www.axdesign.co.uk, Dyson School of Design Engineering, Imperial College London, London, UK; Audio Experience Design, - www.axdesign.co.uk, Dyson School of Design Engineering, Imperial College London, London, UK; Centre for Digital Music, School of Electronic Engineering and Computer Science, Queen Mary University of London, London, UK (See document for exact affiliation information.)
Publication Date: 2025-09-05

DOI:

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Type: Journal Article

AES Conventions

AES Conferences

AES Training & Development

Gift Membership

AES Membership Benefits

Gift Membership

AES Membership Benefits

Become a Sustaining Member

AES Membership Benefits

AES Inside Track

Journal of the AES

AES E-library

AES Sections are active around the world and provide a means for members to meet locally.

AES Student Website

AES Educational Foundation

Student Sections

See the committee’s accomplishments in diversity & inclusion

AES Statement of solidarity

Richard C. Heyser Memorial Lecture Series

AES E-Library

Head-Related Transfer Function Upsampling Using an Autoencoder-Based Generative Adversarial Network With Evaluation Framework

Choose your country of residence from this list:

AES E-Library

Login Institutions

Head-Related Transfer Function Upsampling Using an Autoencoder-Based Generative Adversarial Network With Evaluation Framework

Choose your country of residence from this list: