论文标题

forkbase:可不可分割的,篡改的储存基材,用于分支应用

ForkBase: Immutable, Tamper-evident Storage Substrate for Branchable Applications

论文作者

Lin, Qian, Yang, Kaiyuan, Dinh, Tien Tuan Anh, Cai, Qingchao, Chen, Gang, Ooi, Beng Chin, Ruan, Pingcheng, Wang, Sheng, Xie, Zhongle, Zhang, Meihui, Vandans, Olafs

论文摘要

数据协作活动通常要求可扩展的系统或基于协议的协调。 Git是协作编码的有效推动者,已在全球无数项目中的成功得到证明。因此,将GIT理念应用于超越编码的一般数据协作是激励的。我们称其为数据。但是,原始GIT设计处理文件颗粒上的数据,对于许多数据库应用程序而言,该数据被认为太粗糙了。我们认为,应与数据库系统共同设计数据的GIT。为此,我们开发了forkbase,以使数据用于数据实用。 Forkbase是一种分布式,不可变的存储系统,设计用于数据版本管理和数据协作操作。在此演示中,我们展示了forkbase如何极大地促进协作数据管理,以及其新颖的数据删除技术如何提高存档大量数据版本的存储效率。

Data collaboration activities typically require systematic or protocol-based coordination to be scalable. Git, an effective enabler for collaborative coding, has been attested for its success in countless projects around the world. Hence, applying the Git philosophy to general data collaboration beyond coding is motivating. We call it Git for data. However, the original Git design handles data at the file granule, which is considered too coarse-grained for many database applications. We argue that Git for data should be co-designed with database systems. To this end, we developed ForkBase to make Git for data practical. ForkBase is a distributed, immutable storage system designed for data version management and data collaborative operation. In this demonstration, we show how ForkBase can greatly facilitate collaborative data management and how its novel data deduplication technique can improve storage efficiency for archiving massive data versions.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源