Disaggregated, or non-local, file storage has become a common design pattern in cloud systems, offering benefits of resource pooling and server specialization, where the inherent overhead of separating compute and storage is mostly hidden by storage device latency. We take an alternate approach, motivated by the commercial availability of very low latency non-volatile memory (NVM). By colocating computation and NVM storage, we can provide applications much higher I/O performance, sub-second application failover, and strong consistency. To demonstrate this, we built the Assise distributed file system, based on a persistent, replicated cache coherence protocol for managing a set of colocated NVM storage devices as a layer. Unlike disaggregated file stores, Assise avoids the read and write amplification of page granularity operations. Instead, remote NVM serves as an intermediate, byte-addressable cache between colocated NVM and slower storage, such as SSDs. We compare Assise to Ceph/Bluestore, NFS, and Octopus on a cluster with Intel Optane DC persistent memory modules and SSDs for common cloud applications and benchmarks, such as LevelDB, Postfix, MinuteSort, and FileBench. We find that Assise improves write latency up to 22x, throughput up to 56x, fail-over time up to 103x, and scales up to 6x better than Ceph, while providing stronger consistency semantics.
翻译:分散的或非本地的文件存储已成为云层系统中常见的设计模式, 提供了资源集合和服务器专业化的好处, 在云层系统中, 分离计算和存储的固有间接费用大多隐藏在存储设备内。 我们采取了另一种办法, 其动机是商业上可以获得非常低的悬浮非挥发性内存( NVM ) 。 通过对调计算和 NVM 存储, 我们可以提供更高I/ O性能、 第二应用失败和高度一致性的应用程序。 为了证明这一点, 我们建立了一个 Asssise 分布式文件系统, 其基础是持续复制的缓存一致性协议, 用于管理一组共享的 NVM 存储设备。 不同于分类的文件存储, Assise 避免读写页面颗粒性操作的缩写。 相反, 远程 NVM 是一个中间、 共享的 NVM 和慢存储器之间的可处理缓存, 例如 SSDSDS。 我们将 Asisismissionality to laph/ breaude, NFS, lax lax lavely lax 和SDFSDLSDL lax lax lax lax lax 等等常可提供更好的时间缩缩缩缩缩缩缩缩缩缩缩缩缩缩缩缩、 缩缩缩缩略图。