ADOC: Automatically Harmonizing Dataflow Between Components in Log-Structured Key-Value Stores for Improved Performance

Jinghuan Yu, Sam H. Noh, Young-Ri Choi, Chun Jason Xue

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

16 Citations (Scopus)

Abstract

Log-Structure Merge-tree (LSM) based Key-Value (KV) systems are widely deployed. A widely acknowledged problem with LSM-KVs is write stalls, which refers to sudden performance drops under heavy write pressure. Prior studies have attributed write stalls to a particular cause such as a resource shortage or a scheduling issue. In this paper, we conduct a systematic study on the causes of write stalls by evaluating RocksDB with a variety of storage devices and show that the conclusions that focus on the individual aspects, though valid, are not generally applicable. Through a thorough review and further experiments with RocksDB, we show that data overflow, which refers to the rapid expansion of one or more components in an LSM-KV system due to a surge in data flow into one of the components, is able to explain the formation of write stalls. We contend that by balancing and harmonizing data flow among components, we will be able to reduce data overflow and thus, write stalls. As evidence, we propose a tuning framework called ADOC (Automatic Data Overflow Control) that automatically adjusts the system configurations, specifically, the number of threads and the batch size, to minimize data overflow in RocksDB. Our extensive experimental evaluations with RocksDB show that ADOC reduces the duration of write stalls by as much as 87.9% and improves performance by as much as 322.8% compared with the auto-tuned RocksDB. Compared to the manually optimized state-of-the-art SILK, ADOC achieves up to 66% higher throughput for the synthetic write-intensive workload that we used, while achieving comparable performance for the real-world YCSB workloads. However, SILK has to use over 20% more DRAM on average. © 2023 by The USENIX Association. All rights reserved.
Original languageEnglish
Title of host publicationProceedings of the 21st USENIX Conference on File and Storage Technologies
PublisherUSENIX Association
Pages65-80
ISBN (Print)9781939133328
Publication statusPublished - Feb 2023
Event21st USENIX Conference on File and Storage Technologies (FAST 2023) - Hyatt Regency Santa Clara, Santa Clara, United States
Duration: 21 Feb 202323 Feb 2023
https://www.usenix.org/conference/fast23/venue-hotel-and-travel

Publication series

NameProceedings of the USENIX Conference on File and Storage Technologies, FAST

Conference

Conference21st USENIX Conference on File and Storage Technologies (FAST 2023)
Country/TerritoryUnited States
CitySanta Clara
Period21/02/2323/02/23
Internet address

Bibliographical note

Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).

Fingerprint

Dive into the research topics of 'ADOC: Automatically Harmonizing Dataflow Between Components in Log-Structured Key-Value Stores for Improved Performance'. Together they form a unique fingerprint.

Cite this