AES E-Library

Head-Tracked Spatial Audio: What is the Motion-to-Sound Latency of your Device?

Spatial audio with head-tracked binaural rendering enhances immersion by adjusting source positions to the listener`s orientation. The resulting auditory effect is that the audio scene appears fixed in the surrounding world, increasing the ability of source localization, the sensation of source externalization and ultimately improving the quality of experience. A seamless listening experience, however, depends on a low latency between head movement and the resulting sound at the ears, denoted as Motion-to-Sound (M2S) latency. Accordingly, measuring M2S latency is a crucial aspect for system performance assessment.

The present paper is dedicated to measuring M2S latency of arbitrary head-tracked spatial audio systems in a black-box manner, i.e., without requiring any a priori information or modification of the rendering algorithm and with high degree of compatibility with a variety of spatial audio formats. A motorized head and torso simulator (HATS) is used to generate reproducible motion while simultaneously recording the rendered signal from the device under test (DUT) that is to be analyzed. We propose novel approaches to signal analysis and juxtapose constant-speed and sudden movement measurements. An extensive evaluation examines the ability of the methods to perform measurements and reveals significant differences in the rendering behavior of various commercial off-the-shelf spatial audio devices.

 

Author (s):
Affiliation: (See document for exact affiliation information.)
Publication Date:
Session subject:

DOI:


Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Type:
16938
Choose your country of residence from this list: