144th AES CONVENTION Paper Session P18: Perception — Part 2

AES Milan 2018
Paper Session P18

P18 - Perception – Part 2


Friday, May 25, 11:00 — 13:00 (Scala 4)

Chair:
Franz Zotter, IEM, University of Music and Performing Arts - Graz, Austria

P18-1 The Effect of the Rise Time and Frequency Character of the Sound Source Signal on the Sense of the Early LEVToru Kamekawa, Tokyo University of the Arts - Adachi-ku, Tokyo, Japan; Atsushi Marui, Tokyo University of the Arts - Tokyo, Japan
The effect of the rise-time of the sound source signal on the sense of the early LEV was investigated. Authors conducted the Scheffe’s pairwise comparison method using seven kinds of bandpass noise that has eight kinds of rise-time convolved with impulse responses and played back from seven loudspeakers. From the result, the octave-band noises up to 1 kHz, the early LEV is felt strongly when the rise-time is 40 to 60 ms and the evaluation due to the difference in the rise-time varies in the case of the high frequency band. Additionally the early LEV in the mid-low range is related to the early reflections in 80 ms based on the first wavefront law, and there was almost no relationship between IACC.
Convention Paper 9986 (Purchase now)

P18-2 Plausibility of an Interactive Approaching Motion towards a Virtual Sound Source Based on Simplified BRIR SetsAnnika Neidhardt, Technische Universität Ilmenau - Ilmenau, Germany; Alby Ignatious-Tommy, Technical University Ilmenau - Ilmenau, Germany; Anson Davis Pereppadan, Technical University Ilmenau - Ilmenau, Germany
In this paper the interactive approaching motion towards a virtual loudspeaker created with dynamic binaural synthesis is subject to research. A realization based on a given set of measured binaural room impulse responses (BRIRs) was rated as plausible by all participants in a previous experiment. In this study the same BRIR-data is systematically simplified to investigate the consequences for the perception. This is of interest in the context of position-dynamic reproduction, related interpolation and extrapolation approaches as well as attempts of parameterization. The potential of inaudible data simplification is highly related to the human sensitivity to position-dependent changes in room acoustics. The results suggest a high potential for simplification, while some kinds of BRIR-impairment clearly affect the plausibility.
Convention Paper 9987 (Purchase now)

P18-3 The Impact of Trajectories of Head and Source Movements on Perceived Externalization of a Frontal Sound SourceSong Li, Leibniz Universität Hannover - Hannover, Germany; Jiaxiang E, Leibniz Universität Hannover - Hannover, Germany; Roman Schlieper, Leibniz Universität Hannover - Hannover, Germany; Jürgen Peissig, Leibniz Universität Hannover - Hannover, Germany
Two listening experiments were performed to investigate the influence of different trajectories of head and source movements on perceived externalization of a frontal sound source. In the first listening test, virtual moving sound sources with seven various trajectories were presented over headphones, while subjects’ heads remained stationary. In the second test, subjects were asked to rotate their heads on three predefined trajectories coupled with real-time binaural rendering, while the simulated virtual sound source was kept stationary. After each presentation, subjects should rate the degree of perceived externalization. Results suggested that large head and source movements can improve perceived externalization, except source movements in the front/back direction. In addition, small source or head movements do not have the influence on externalization.
Convention Paper 9988 (Purchase now)

P18-4 Evaluation of Binaural Renderers: Externalization, Front/Back and Up/Down ConfusionsGregory Reardon, New York University - New York, NY, USA; Gabriel Zalles, New York University - New York, NY, USA; Andrea Genovese, New York University - New York, NY, USA; Patrick Flanagan, THX Ltd. - San Francisco, CA, USA; Agnieszka Roginska, New York University - New York, NY, USA
Binaural renderers can be used to reproduce dynamic spatial audio over headphones and deliver immersive audio content. Six commercially available binaural renderers with different rendering methodologies were evaluated in a multi-phase subjective study. This paper presents and discusses the testing methodology, evaluation criteria, and main findings of the externalization, front/back discrimination and up/down discrimination tasks that are part of the first phase. Statistical analysis over a large number of subjects revealed that the choice of renderer has a significant effect on all three dependent measures. Further, ratings of perceived externalization for the renderers were found to be content-specific, while renderer reversal rates were much more robust to different stimuli.
Convention Paper 9989 (Purchase now)


Return to Paper Sessions