经过幅值纠正和时间对齐的头相关转移函数插值 (Magnitude-Corrected and Time-Aligned Interpolation of Head-Related Transfer Functions)

Head-related transfer functions (HRTFs) are essential for virtual acoustic realities, as they contain all cues for localizing sound sources in three-dimensional space. Acoustic measurements are one way to obtain high-quality HRTFs. To reduce measurement time, cost, and complexity of measurement systems, a promising approach is to capture only a few HRTFs on a sparse sampling grid and then upsample them to a dense HRTF set by interpolation. However, HRTF interpolation is challenging because small changes in source position can result in significant changes in the HRTF phase and magnitude response. Previous studies greatly improved the interpolation by time-aligning the HRTFs in preprocessing, but magnitude interpolation errors, especially in contralateral regions, remain a problem. Building upon the time-alignment approaches, we propose an additional post-interpolation magnitude correction derived from a frequency-smoothed HRTF representation. Employing all 96 individual simulated HRTF sets of the HUTUBS database, we show that the magnitude correction significantly reduces interpolation errors compared to state-of-the-art interpolation methods applying only time alignment. Our analysis shows that when upsampling very sparse HRTF sets, the subject-averaged magnitude error in the critical higher frequency range is up to 1.5 dB lower when averaged over all directions and even up to 4 dB lower in the contralateral region. As a result, the interaural level differences in the upsampled HRTFs are considerably improved. The proposed algorithm thus has the potential to further reduce the minimum number of HRTFs required for perceptually transparent interpolation.

翻译：头相关转移函数（HRTF）对于虚拟声学实现至关重要，因为它们包含了在三维空间中定位声源的所有线索。声学测量是获取高质量HRTF的一种方式。为了缩短测量时间、降低成本和测量系统的复杂性，一种有前途的方法是仅在稀疏抽样网格上捕获少量HRTF，然后通过插值将其上采样到密集HRTF集。然而，HRTF插值具有挑战性，因为源位置的微小变化可能会导致HRTF相位和幅度响应的显著变化。以前的研究通过预处理中的时间对齐显着改进了插值，但幅度插值误差，特别是在对侧区域，仍然是一个问题。在时间对齐方法之上，我们提出了一种额外的基于频率平滑的HRTF表示的插值后幅度校正方法。利用 HUTUBS 数据库的96个单独模拟HRTF集，我们表明幅度校正显著降低了插值误差，与仅应用时间对齐的最新插值方法相比。我们的分析表明，在上采样非常稀疏的HRTF集时，所有方向的主观平均幅度误差在关键较高频率范围内平均时低至1.5 dB，甚至在对侧区域可降低至4 dB。因此，上采样后的HRTFs的双耳声级差得到了显著的改善。因此，该算法有潜力进一步减少需要的最小HRTF数量，以实现感觉透明插值。