Performance Debugging in Data Centers: Doing More with Less

Publication Date

2009

Journal or Book Title

2009 FIRST INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORKS (COMSNETS 2009)

Abstract

With the increasing scale and complexity of data centers, detecting and localizing performance faults in real-time has become both a pressing need and a challenge. While several approaches for performance debugging in data centers have been proposed, these techniques do not assume any constraints on the availability of operational data needed to detect and localize faults. We argue that collecting such operational data often requires significant instrumentation or intrusiveness, which is difficult to realize in production data centers. Such constraints complicate the deployment of existing techniques or limit their effectiveness in practice. In this paper, we argue that for performance debugging to become practical and effective in realworld systems, one needs to develop techniques that are ldquomore effectiverdquo with ldquoless instrumentation and intrusivenessrdquo. We raise several issues and challenges in realizing this vision and present some initial ideas on addressing these challenges.

DOI

https://doi.org/10.1109/COMSNETS.2009.4808877

Pages

366-374

This document is currently not available here.

Share

COinS