DeltaCFS: Boosting Delta Sync for Cloud Storage Services by Learning from NFS Quanlu Zhang∗, Zhenhua Liy, Zhi Yang∗, Shenglong Li∗, Shouyang Li∗, Yangze Guo∗, and Yafei Dai∗z ∗Peking University yTsinghua University zInstitute of Big Data Technologies Shenzhen Key Lab for Cloud Computing Technology & Applications fzql, yangzhi, lishenglong, lishouyang, guoyangze,
[email protected],
[email protected] Abstract—Cloud storage services, such as Dropbox, iCloud find a common root cause that undermines the effect of delta Drive, Google Drive, and Microsoft OneDrive, have greatly sync: these applications heavily rely on structured data which facilitated users’ synchronizing files across heterogeneous devices. are stored in tabular files (e.g., SQLite [12] files), rather than Among them, Dropbox-like services are particularly beneficial owing to the delta sync functionality that strives towards greater simple textual files (e.g., .txt, .tex, and .log files). For exam- network-level efficiency. However, when delta sync trades com- ple, today’s smartphones (like iPhone) regularly synchronize putation overhead for network-traffic saving, the tradeoff could local data to the cloud storage (like iCloud [13]), where a be highly unfavorable under some typical workloads. We refer considerable part of the synchronized is tabular data [14]. to this problem as the abuse of delta sync. Confronted with the abovementioned situation, delta sync To address this problem, we propose DeltaCFS, a novel file sync framework for cloud storage services by learning from the exhibits poor performance, because it often generates intoler- design of conventional NFS (Network File System). Specifically, ably high computation overhead when dealing with structured we combine delta sync with NFS-like file RPC in an adaptive data.