Modern cloud-native systems are dynamic and distributed, making the monitoring of cloud infrastructure challenging with traditional tools meant for static environments. This has led to the development and adoption of specialized observability platforms.
Prometheus is an open-source observability tool tailored for cloud-native settings. Its integration with Kubernetes and its pull-based data collection model have made it popular in DevOps. Nonetheless, Prometheus often struggles with handling large data volumes and lacks adequate cost-optimization capabilities, raising the question of managing Prometheus deployments at a large scale.
Eric Schabell, Director of Community and Developer at Chronosphere and a CNCF Ambassador, discusses metrics collection, time series data, managing Prometheus at scale, and the trade-offs between self-hosted and managed observability with Kevin Ball.
