TY - GEN
T1 - R2B
T2 - 59th ACM/IEEE Design Automation Conference (DAC 2022)
AU - Sun, Diansen
AU - Chai, Yunpeng
AU - Liu, Chaoyang
AU - Sun, Weihao
AU - Zhang, Qingpeng
PY - 2022
Y1 - 2022
N2 - Big data applications have differentiated requirements for I/O resources in cloud environments. For instance, data analytic and AI/ML applications usually have periodical burst I/O traffic, and data stream processing and database applications often introduce fluctuating I/O loads based on a guaranteed I/O bandwidth. However, the existing resource isolation model (i.e., RLW) and methods (e.g., Token-bucket, mClock, and cgroup) cannot support the fluctuating I/O load and differentiated I/O demands well, and thus cannot achieve fairness, high resource utilization, and high performance for applications at the same time. In this paper, we propose a novel efficient and fair I/O resource isolation model and method called R2B, which can adapt to the differentiated I/O characteristics and requirements of different applications in a shared resource environment. R2B can simultaneously satisfy the fairness and achieve both high application efficiency and high bandwidth utilization. This work aims to help the cloud provider achieve higher utilization by shifting the burden to the cloud customers to specify their type of workload.
AB - Big data applications have differentiated requirements for I/O resources in cloud environments. For instance, data analytic and AI/ML applications usually have periodical burst I/O traffic, and data stream processing and database applications often introduce fluctuating I/O loads based on a guaranteed I/O bandwidth. However, the existing resource isolation model (i.e., RLW) and methods (e.g., Token-bucket, mClock, and cgroup) cannot support the fluctuating I/O load and differentiated I/O demands well, and thus cannot achieve fairness, high resource utilization, and high performance for applications at the same time. In this paper, we propose a novel efficient and fair I/O resource isolation model and method called R2B, which can adapt to the differentiated I/O characteristics and requirements of different applications in a shared resource environment. R2B can simultaneously satisfy the fairness and achieve both high application efficiency and high bandwidth utilization. This work aims to help the cloud provider achieve higher utilization by shifting the burden to the cloud customers to specify their type of workload.
KW - fairness
KW - I/O scheduling
KW - multi-tenant
KW - QoS
UR - http://www.scopus.com/inward/record.url?scp=85137458797&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-85137458797&origin=recordpage
U2 - 10.1145/3489517.3530521
DO - 10.1145/3489517.3530521
M3 - RGC 32 - Refereed conference paper (with host publication)
SN - 9781450391429
T3 - Proceedings - Design Automation Conference
SP - 883
EP - 888
BT - DAC '22: Proceedings of the 59th ACM/IEEE Design Automation Conference
PB - IEEE
Y2 - 10 July 2022 through 14 July 2022
ER -