Ensuring performance robustness for a variety of situations that can occur in real-world environments is one of the challenging tasks in sound event classification. One of the unpredictable and detrimental factors in performance, especially in indoor environments, is reverberation. To alleviate this problem, we propose a conditioning method that provides room impulse response (RIR) information to help the network become less sensitive to environmental information and focus on classifying the desired sound. Experimental results show that the proposed method successfully reduced performance degradation caused by the reverberation of the room. In particular, our proposed method works even with similar RIR that can be inferred from the room type rather than the exact one, which has the advantage of potentially being used in real-world applications.
翻译:为缓解这一问题,我们提议了一种调节方法,即提供室速反应(RIR)信息,帮助网络降低对环境信息的敏感度,并侧重于对所希望的音响进行分类。实验结果显示,拟议的方法成功地降低了由于房间反响造成的性能退化。特别是,我们提议的方法,即使从室型而不是从确切的RIR中可以推断出类似的RIR,也可能在实际应用中使用。