Abstract
Log-Structure Merge-tree (LSM) based Key-Value (KV) systems are widely deployed. A widely acknowledged problem with LSM-KVs is write stalls, which refers to sudden performance drops under heavy write pressure. Prior studies have attributed write stalls to a particular cause such as a resource shortage or a scheduling issue. In this paper, we conduct a systematic study on the causes of write stalls by evaluating RocksDB with a variety of storage devices and show that the conclusions that focus on the individual aspects, though valid, are not generally applicable. Through a thorough review and further experiments with RocksDB, we show that data overflow, which refers to the rapid expansion of one or more components in an LSM-KV system due to a surge in data flow into one of the components, is able to explain the formation of write stalls. We contend that by balancing and harmonizing data flow among components, we will be able to reduce data overflow and thus, write stalls. As evidence, we propose a tuning framework called ADOC (Automatic Data Overflow Control) that automatically adjusts the system configurations, specifically, the number of threads and the batch size, to minimize data overflow in RocksDB. Our extensive experimental evaluations with RocksDB show that ADOC reduces the duration of write stalls by as much as 87.9% and improves performance by as much as 322.8% compared with the auto-tuned RocksDB. Compared to the manually optimized state-of-the-art SILK, ADOC achieves up to 66% higher throughput for the synthetic write-intensive workload that we used, while achieving comparable performance for the real-world YCSB workloads. However, SILK has to use over 20% more DRAM on average. © 2023 by The USENIX Association. All rights reserved.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 21st USENIX Conference on File and Storage Technologies |
| Publisher | USENIX Association |
| Pages | 65-80 |
| ISBN (Print) | 9781939133328 |
| Publication status | Published - Feb 2023 |
| Event | 21st USENIX Conference on File and Storage Technologies (FAST 2023) - Hyatt Regency Santa Clara, Santa Clara, United States Duration: 21 Feb 2023 → 23 Feb 2023 https://www.usenix.org/conference/fast23/venue-hotel-and-travel |
Publication series
| Name | Proceedings of the USENIX Conference on File and Storage Technologies, FAST |
|---|
Conference
| Conference | 21st USENIX Conference on File and Storage Technologies (FAST 2023) |
|---|---|
| Place | United States |
| City | Santa Clara |
| Period | 21/02/23 → 23/02/23 |
| Internet address |
Bibliographical note
Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).Funding
This research was supported by the MSIT (Ministry of Science and ICT), Korea, under the ITRC (Information Technology Research Center) support program (IITP-2022-2021-0-01817) supervised by the IITP (Institute for Information Communications Technology Planning Evaluation), and also partially supported by a grant from the Research Grants Council of the Hong Kong Special Administrative Region, China (Project No. CityU 11217020).
RGC Funding Information
- RGC-funded
Fingerprint
Dive into the research topics of 'ADOC: Automatically Harmonizing Dataflow Between Components in Log-Structured Key-Value Stores for Improved Performance'. Together they form a unique fingerprint.Projects
- 1 Finished
-
GRF: How to Utilize a Huge Number of Flash Chips to Meet the Performance and Reliability Requirements via Self-Healing and Partitioning
XUE, C. J. (Principal Investigator / Project Coordinator)
1/07/20 → 2/01/24
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver