DLHub:科学模型和数据服务 (DLHub: Model and Data Serving for Science)

While the Machine Learning (ML) landscape is evolving rapidly, there has been a relative lag in the development of the "learning systems" needed to enable broad adoption. Furthermore, few such systems are designed to support the specialized requirements of scientific ML. Here we present the Data and Learning Hub for science (DLHub), a multi-tenant system that provides both model repository and serving capabilities with a focus on science applications. DLHub addresses two significant shortcomings in current systems. First, its selfservice model repository allows users to share, publish, verify, reproduce, and reuse models, and addresses concerns related to model reproducibility by packaging and distributing models and all constituent components. Second, it implements scalable and low-latency serving capabilities that can leverage parallel and distributed computing resources to democratize access to published models through a simple web interface. Unlike other model serving frameworks, DLHub can store and serve any Python 3-compatible model or processing function, plus multiple-function pipelines. We show that relative to other model serving systems including TensorFlow Serving, SageMaker, and Clipper, DLHub provides greater capabilities, comparable performance without memoization and batching, and significantly better performance when the latter two techniques can be employed. We also describe early uses of DLHub for scientific applications.

翻译：虽然机器学习(ML)格局正在迅速演变,但在开发必要的“学习系统”以便广泛采用方面却相对滞后,此外,这类系统也很少用来支持科学ML的专门要求。这里我们展示了科学数据和学习枢纽(DLHub),这是一个提供模式储存和服务能力的多维系统,以科学应用为重点。DLHub处理当前系统中两个重大缺陷。首先,其自助服务模式库允许用户通过包装和分发模型及所有组成部分,分享、公布、核查、复制和再利用模型,并解决与模型再现有关的问题。第二,它实施可扩展和低延迟的服务能力,通过简单的网络界面,利用平行和分散的计算机资源,使已出版模型的进入民主化。与其他模式服务框架不同的是,DLub可以储存和提供任何比对立的3模型或处理功能,加上多重功能管道。我们显示,与其他模型服务系统相比,通过包装和分发模型以及所有组成部分,我们可以通过包装和分发模型和克里普(Clipper),DLHub等系统实施可扩缩的可操作能力,在不大量使用科学应用的早期和后,我们也能提供较强的成绩。

相关内容

MoDELS

关注 30

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

105+阅读 · 2020年5月3日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

161+阅读 · 2020年3月18日

深度强化学习策略梯度教程，53页ppt

专知会员服务

176+阅读 · 2020年2月1日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

52+阅读 · 2020年1月30日