Artificial intelligence (AI) increasingly powers sensitive applications in domains such as healthcare and finance, relying on both linear operations (e.g., matrix multiplications in large language models) and non-linear operations (e.g., sorting in retrieval-augmented generation). Fully homomorphic encryption (FHE) has emerged as a promising tool for privacy-preserving computation, but it remains unclear whether existing methods can support the full spectrum of AI workloads that combine these operations. In this SoK, we ask: Can FHE support general AI computation? We provide both a functional analysis and a cost analysis. First, we categorize ten distinct FHE approaches and evaluate their ability to support general computation. We then identify three promising candidates and benchmark workloads that mix linear and non-linear operations across different bit lengths and SIMD parallelization settings. Finally, we evaluate five real-world, privacy-sensitive AI applications that instantiate these workloads. Our results quantify the costs of achieving general computation in FHE and offer practical guidance on selecting FHE methods that best fit specific AI application requirements. Our codes are available at https://github.com/UCF-ML-Research/FHE-AI-Generality.
翻译:人工智能(AI)在医疗和金融等敏感领域的应用日益广泛,其计算既依赖于线性运算(如大语言模型中的矩阵乘法),也依赖于非线性运算(如检索增强生成中的排序操作)。全同态加密(FHE)已成为隐私保护计算中一种前景广阔的工具,但现有方法能否支持结合了上述运算的完整AI工作负载尚不明确。本文通过系统化研究(SoK)提出核心问题:FHE能否支持通用AI计算?我们同时开展了功能分析与成本分析。首先,我们对十种不同的FHE方法进行分类,并评估其支持通用计算的能力。随后,我们筛选出三种具有潜力的方案,并构建了混合线性与非线性运算的基准工作负载,涵盖不同位宽和SIMD并行化配置。最后,我们评估了五个实际隐私敏感型AI应用案例,这些案例实例化了前述工作负载。实验结果量化了在FHE中实现通用计算所需的成本,并为选择最契合特定AI应用需求的FHE方法提供了实践指导。相关代码已开源:https://github.com/UCF-ML-Research/FHE-AI-Generality。