GALA: 隐私保护神经网络线性代数的贪婪计算 (GALA: Greedy ComputAtion for Linear Algebra in Privacy-Preserved Neural Networks)

Machine Learning as a Service (MLaaS) is enabling a wide range of smart applications on end devices. However, privacy-preserved computation is still expensive. Our investigation has found that the most time-consuming component of the HE-based linear computation is a series of Permutation (Perm) operations that are imperative for dot product and convolution in privacy-preserved MLaaS. To this end, we propose GALA: Greedy computAtion for Linear Algebra in privacy-preserved neural networks, which views the HE-based linear computation as a series of Homomorphic Add, Mult and Perm operations and chooses the least expensive operation in each linear computation step to reduce the overall cost. GALA makes the following contributions: (1) It introduces a row-wise weight matrix encoding and combines the share generation that is needed for the GC-based nonlinear computation, to reduce the Perm operations for the dot product; (2) It designs a first-Add-second-Perm approach (named kernel grouping) to reduce Perm operations for convolution. As such, GALA efficiently reduces the cost for the HE-based linear computation, which is a critical building block in almost all of the recent frameworks for privacy-preserved neural networks, including GAZELLE (Usenix Security'18), DELPHI (Usenix Security'20), and CrypTFlow2 (CCS'20). With its deep optimization of the HE-based linear computation, GALA can be a plug-and-play module integrated into these systems to further boost their efficiency. Our experiments show that it achieves a significant speedup up to 700x for the dot product and 14x for the convolution computation under different data dimensions. Meanwhile, GALA demonstrates an encouraging runtime boost by 2.5x, 2.7x, 3.2x, 8.3x, 7.7x, and 7.5x over GAZELLE and 6.5x, 6x, 5.7x, 4.5x, 4.2x, and 4.1x over CrypTFlow2, on AlexNet, VGG, ResNet-18, ResNet-50, ResNet-101, and ResNet-152, respectively.

翻译：我们的调查发现,基于 HE 的线性计算中最耗时的操作部分是一系列更替(Perm)操作,这是在以隐私为维护的 MLaaS 中,对点产品和变异(Perm)必须做的。为此,我们提议GALA:在有隐私的神经网络中,对线性代数进行精密的计算,将基于E的线性计算视为一系列的测算、Mult和Perm操作,并选择每条线性计算步骤中最花费时间的操作来降低总成本。 GALA做了以下贡献:(1) 它引入了直线性重量矩阵编码,并将基于GC的非线性计算所需的份额生成合并起来,以降低对点产品的 Perm操作;(2) 它设计了一种基于隐私的直线性电离子计算法, 以直线性电离子计算为基的直线性电离子电解算法, 其直线性电算系统将成本降低其直径直径直径内, 内,其直线性内, 直径内地平的直径对内,其直径内地算系统的内,其直径内,其直径内,其内,其内,直径内,其内,其内,直地,直地,直地,直地,直地,直地,直地,直地,直地,直地,直地,内地,内地,内地,内地,内,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内,内地,内地,内地,内地,内地,内地,内地,内地,内地,内地,内,内,内,内,内,