📊 OBSERVABILITÉ

Tout voir

Full-spectrum observability: metrics, logs, traces, profiles, and cost monitoring in a unified stack.

LES QUATRE PILIERS DE L'OBSERVABILITÉ

Métriques

Mimir

Données temporelles avec PromQL. Rétention longue durée, requêtes à vue globale, API compatible Thanos.

Logs

Loki

Agrégation de logs indexée par labels. Requêtes LogQL, empreinte stockage minimale, compatible Prometheus.

Traces

Tempo

Traçage distribué avec support Jaeger/Zipkin/OTLP. Corrélation trace-vers-log et trace-vers-métrique.

Profils

Pyroscope

Profilage continu pour CPU, mémoire, goroutines. Flame graphs, comparaison temporelle.

Pipeline de télémétrie

Grafana Alloy est l'agent unique qui collecte tout — remplaçant Promtail, Grafana Agent et OTel Collector.

graph LR
  WL["Workloads
DaemonSet on every node"]:::workloads
  ALLOY["Grafana Alloy
StatefulSet + DaemonSet"]:::alloy
  MIMIR["Mimir
Metrics"]:::storage
  LOKI["Loki
Logs"]:::storage
  TEMPO["Tempo
Traces"]:::storage
  PYRO["Pyroscope
Profiles"]:::storage
  GRAF["Grafana
SSO via Keycloak"]:::grafana

  WL -->|"scrape + forward"| ALLOY
  ALLOY -->|"metrics"| MIMIR
  ALLOY -->|"logs"| LOKI
  ALLOY -->|"traces"| TEMPO
  ALLOY -->|"profiles"| PYRO
  MIMIR --> GRAF
  LOKI --> GRAF
  TEMPO --> GRAF
  PYRO --> GRAF

  classDef workloads fill:#1e293b,stroke:#e2e8f0,color:#e2e8f0,stroke-width:2px
  classDef alloy fill:#0e3a3a,stroke:#06b6d4,color:#67e8f9,stroke-width:2px
  classDef storage fill:#2e2a0e,stroke:#facc15,color:#fde68a,stroke-width:2px
  classDef grafana fill:#14332a,stroke:#4ade80,color:#86efac,stroke-width:2px

Tous les composants

Prometheus

production

Pull-based metrics collection with multi-dimensional data model and powerful PromQL query language.

Rôle : Primary metrics scraping for all platform services via ServiceMonitors

Grafana

production

Visualization platform connecting metrics, logs, traces, and profiles in unified dashboards.

Rôle : Central observability UI with pre-built dashboards for every platform component

Mimir

production

Horizontally-scalable long-term metrics storage with Prometheus-compatible API.

Rôle : Indefinite metrics retention with high compression and fast queries

Loki

production

Log aggregation system inspired by Prometheus — indexes labels, not full log lines.

Rôle : Centralized logging with LogQL queries across all namespaces

Tempo

production

Distributed tracing backend supporting Jaeger, Zipkin, and OpenTelemetry formats.

Rôle : End-to-end request tracing across microservices

Pyroscope

production

Continuous profiling platform for CPU, memory, goroutine, and lock contention analysis.

Rôle : Runtime performance profiling with flame graph visualization

Grafana Alloy

production

Unified telemetry collector replacing Promtail, Grafana Agent, and OpenTelemetry Collector.

Rôle : Single agent collecting metrics, logs, traces, and profiles from all nodes

OpenCost

production

Real-time Kubernetes cost monitoring with per-namespace and per-workload breakdown.

Rôle : Infrastructure cost visibility and optimization recommendations