Accurately learning high-frequency signals is a challenge in computer vision and graphics, as neural networks often struggle with these signals due to spectral bias or optimization difficulties. While current techniques like Fourier encodings have made great strides in improving performance, there remains scope for improvement when presented with high-frequency information. This paper introduces Queried-Convolutions (Qonvolutions), a simple yet powerful modification using the neighborhood properties of convolution. Qonvolution convolves a low-frequency signal with queries (such as coordinates) to enhance the learning of intricate high-frequency signals. We empirically demonstrate that Qonvolutions enhance performance across a variety of high-frequency learning tasks crucial to both the computer vision and graphics communities, including 1D regression, 2D super-resolution, 2D image regression, and novel view synthesis (NVS). In particular, by combining Gaussian splatting with Qonvolutions for NVS, we showcase state-of-the-art performance on real-world complex scenes, even outperforming powerful radiance field models on image quality.
翻译:精确学习高频信号是计算机视觉与图形学领域的一项挑战,由于频谱偏差或优化困难,神经网络通常难以有效处理此类信号。尽管现有技术如傅里叶编码在提升性能方面已取得显著进展,但在处理高频信息时仍有改进空间。本文提出查询卷积(Qonvolutions),这是一种利用卷积邻域特性的简洁而有效的改进方法。Qonvolution通过将低频信号与查询项(如坐标)进行卷积运算,以增强对复杂高频信号的学习能力。我们通过实验证明,Qonvolutions在计算机视觉与图形学领域的关键高频学习任务中均能提升性能,包括一维回归、二维超分辨率、二维图像回归以及新视角合成(NVS)。特别地,通过将高斯泼溅与Qonvolutions结合应用于NVS任务,我们在真实世界复杂场景中实现了最先进的性能表现,甚至在图像质量方面超越了强大的辐射场模型。