TY - JOUR
T1 - Availability modeling and cost optimization for the grid resource management system
AU - Dai, Yuan-Shun
AU - Xie, Min
AU - Poh, Kim-Leng
PY - 2008
Y1 - 2008
N2 - Grid computing is a recently developed technique for complex systems with large-scale resource sharing, wide-area communication, and multi-institutional collaboration. Although the development tools and techniques for the grid have been extensively investigated, the availability of the grid resource management system (RMS) has not been comprehensively studied. In order to contribute to this lacking but important field, this paper first models the grid RMS availability by considering both the failures of resource management (RM) servers and the length limitation of request queues. A hierarchical Markov reward model is implemented to evaluate the grid RMS availability. Based on the availability model, an optimization problem for designing the grid RMS is studied in order to minimize the cost by determining the best number of RM servers. Then, the sensitivity analysis is conducted, and a dynamic switching scheduling method is further presented based on the sensitivity analysis. © 2007 IEEE.
AB - Grid computing is a recently developed technique for complex systems with large-scale resource sharing, wide-area communication, and multi-institutional collaboration. Although the development tools and techniques for the grid have been extensively investigated, the availability of the grid resource management system (RMS) has not been comprehensively studied. In order to contribute to this lacking but important field, this paper first models the grid RMS availability by considering both the failures of resource management (RM) servers and the length limitation of request queues. A hierarchical Markov reward model is implemented to evaluate the grid RMS availability. Based on the availability model, an optimization problem for designing the grid RMS is studied in order to minimize the cost by determining the best number of RM servers. Then, the sensitivity analysis is conducted, and a dynamic switching scheduling method is further presented based on the sensitivity analysis. © 2007 IEEE.
KW - Availability
KW - Grid computing
KW - Markov models
KW - Optimization
KW - Reliability
KW - Resource Management System (RMS)
UR - http://www.scopus.com/inward/record.url?scp=54949134883&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-54949134883&origin=recordpage
U2 - 10.1109/TSMCA.2007.909546
DO - 10.1109/TSMCA.2007.909546
M3 - RGC 22 - Publication in policy or professional journal
SN - 1083-4427
VL - 38
SP - 170
EP - 179
JO - IEEE Transactions on Systems, Man, and Cybernetics Part A:Systems and Humans
JF - IEEE Transactions on Systems, Man, and Cybernetics Part A:Systems and Humans
IS - 1
ER -