Performance Debugging in Data Centers: Doing More with Less
Publication Date
2009
Journal or Book Title
2009 FIRST INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORKS (COMSNETS 2009)
Abstract
With the increasing scale and complexity of data centers, detecting and localizing performance faults in real-time has become both a pressing need and a challenge. While several approaches for performance debugging in data centers have been proposed, these techniques do not assume any constraints on the availability of operational data needed to detect and localize faults. We argue that collecting such operational data often requires significant instrumentation or intrusiveness, which is difficult to realize in production data centers. Such constraints complicate the deployment of existing techniques or limit their effectiveness in practice. In this paper, we argue that for performance debugging to become practical and effective in realworld systems, one needs to develop techniques that are ldquomore effectiverdquo with ldquoless instrumentation and intrusivenessrdquo. We raise several issues and challenges in realizing this vision and present some initial ideas on addressing these challenges.
DOI
https://doi.org/10.1109/COMSNETS.2009.4808877
Pages
366-374
Recommended Citation
Cecchet, E; Natu, M; Sadaphal, V; Shenoy, P; and Vin, H, "Performance Debugging in Data Centers: Doing More with Less" (2009). 2009 FIRST INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORKS (COMSNETS 2009). 1007.
https://doi.org/10.1109/COMSNETS.2009.4808877