Skip to main navigation Skip to search Skip to main content

Link/Switch Failure Analysis of Data Center Networks on Matroidal Connectivity

Wanling Lin, Xiao-Yan Li*, Jou-Ming Chang, Xiaohua Jia

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

With the surge of bandwidth demand for cloud applications and the exponential growth of data, data center networks (DCNs) are expanding rapidly, followed by the daily increasing likelihood of failures. Such failures, whether due to device or link issues, are inevitable and often lead to packet loss, transmission delays, and even system downtime. Thus, it is crucial to assess the fault-tolerant capabilities of data center networks using appropriate reliability metrics when failures occur. BCube is a well-known server-centric data center network with many advantages, such as rich low-diameter paths, high throughput, and excellent expandability. Not only do the recently proposed matroidal connectivity and conditional matroidal connectivity have reasonable fault assumptions that align well with the structural characteristics of data center networks, but they also significantly enhance the fault tolerance performance of DCNs. This paper determines the matroidal connectivity and conditional matroidal connectivity of BCube, which is the first study to apply the two reliability metrics in DCNs. Then, we extend the conclusions about (conditional) matroidal connectivity to analyze the fault tolerance of BCube in the occurrence of switch failures. In addition, we develop an efficient algorithm to identify the structural features of minimum faulty edge sets, where the cardinality of these edge sets corresponds to the conditional matroidal connectivity of BCube. Finally, we experimentally evaluate the effects of both link and switch failures on BCube’s performance under the matroidal restriction. The experimental analyses reveal that BCube DCNs exhibit high fault tolerance under matroidal constraints, with the ability to withstand both link and switch failures. © 2025 IEEE.
Original languageEnglish
Pages (from-to)1220-1235
JournalIEEE Transactions on Networking
Volume33
Issue number3
Online published13 Jun 2025
DOIs
Publication statusPublished - Jun 2025

Funding

The work of XiaoYan Li was supported in part by the NSFC under Grant 62002062 and in part by the NSFF under Grant 2022J05029. The work of Jou-Ming Chang was supported by the NSTC under Grant NSTC-112-2221-E-141-004.

Research Keywords

  • Switches
  • Data centers
  • Fault tolerant systems
  • Fault tolerance
  • Servers
  • Reliability
  • Network topology
  • Topology
  • Routing
  • Resilience
  • Matroidal connectivity
  • conditional matroidal connectivity
  • data center networks (DCNs)
  • BCube
  • link failures
  • switch failures

Fingerprint

Dive into the research topics of 'Link/Switch Failure Analysis of Data Center Networks on Matroidal Connectivity'. Together they form a unique fingerprint.

Cite this