TY - JOUR
T1 - Spatial Analysis and Synthesis Methods
T2 - Subjective and Objective Evaluations Using Various Microphone Arrays in the Auralization of a Critical Listening Room
AU - Pawlak, Alan
AU - Lee, Hyunkook
AU - Makivirta, Aki
AU - Lund, Thomas
N1 - Funding Information:
This work was supported in part by Genelec Oy and in part by the University of Huddersfield.
Publisher Copyright:
© 2014 IEEE.
PY - 2024/9/12
Y1 - 2024/9/12
N2 - Parametric sound field reproduction methods, such as the Spatial Decomposition Method (SDM) and Higher-Order Spatial Impulse Response Rendering (HO-SIRR), are widely used for the analysis and auralization of sound fields. This paper studies the performance of various sound field reproduction methods in the context of the auralization of a critical listening room, focusing on fixed head orientations. The influence on the perceived spatial and timbral fidelity of the following factors is considered: the rendering framework, direction of arrival (DOA) estimation method, microphone array structure, and use of a dedicated center reference microphone with SDM. Listening tests compare the synthesized sound fields to a reference binaural rendering condition, all for static head positions. Several acoustic parameters are measured to gain insights into objective differences between methods. All systems were distinguishable from the reference in perceptual tests. A high-quality pressure microphone improves the SDM framework's timbral fidelity, and spatial fidelity in certain scenarios. Additionally, SDM and HO-SIRR show similarities in spatial fidelity. Performance variation between SDM configurations is influenced by the DOA estimation method and microphone array construction. The binaural SDM (BSDM) presentations display temporal artifacts impacting sound quality.
AB - Parametric sound field reproduction methods, such as the Spatial Decomposition Method (SDM) and Higher-Order Spatial Impulse Response Rendering (HO-SIRR), are widely used for the analysis and auralization of sound fields. This paper studies the performance of various sound field reproduction methods in the context of the auralization of a critical listening room, focusing on fixed head orientations. The influence on the perceived spatial and timbral fidelity of the following factors is considered: the rendering framework, direction of arrival (DOA) estimation method, microphone array structure, and use of a dedicated center reference microphone with SDM. Listening tests compare the synthesized sound fields to a reference binaural rendering condition, all for static head positions. Several acoustic parameters are measured to gain insights into objective differences between methods. All systems were distinguishable from the reference in perceptual tests. A high-quality pressure microphone improves the SDM framework's timbral fidelity, and spatial fidelity in certain scenarios. Additionally, SDM and HO-SIRR show similarities in spatial fidelity. Performance variation between SDM configurations is influenced by the DOA estimation method and microphone array construction. The binaural SDM (BSDM) presentations display temporal artifacts impacting sound quality.
KW - Ambisonics
KW - Auralization
KW - Binaural Rendering
KW - Binaural Room Impulse Responses (BRIR)
KW - Direction of Arrival (DOA)
KW - Direction-of-arrival estimation
KW - Estimation
KW - Higher-Order Spatial Impulse Response Rendering (HO-SIRR)
KW - Loudspeakers
KW - Microphone Arrays
KW - Microphone arrays
KW - MUSHRA
KW - Pseudo Intensity Vectors (PIV)
KW - Rendering (computer graphics)
KW - Spatial Audio
KW - Spatial Decomposition Method (SDM)
KW - Subjective Audio Evaluation
KW - Time Difference of Arrival (TDOA)
KW - Vectors
KW - binaural rendering
KW - mushra
KW - binaural room impulse responses (BRIR)
KW - Spatial audio
KW - subjective audio evaluation
KW - higher-order spatial impulse response rendering (HO-SIRR)
KW - microphone arrays
KW - spatial decomposition method (SDM)
KW - time difference of arrival (TDOA)
KW - auralization
KW - direction of arrival (DOA)
KW - pseudo intensity vectors (PIV)
UR - http://www.scopus.com/inward/record.url?scp=85201745851&partnerID=8YFLogxK
U2 - 10.1109/TASLP.2024.3449037
DO - 10.1109/TASLP.2024.3449037
M3 - Article
AN - SCOPUS:85201745851
VL - 32
SP - 3986
EP - 4001
JO - IEEE/ACM Transactions on Audio Speech and Language Processing
JF - IEEE/ACM Transactions on Audio Speech and Language Processing
SN - 2329-9290
M1 - 10645201
ER -