📊 المراقبة

رؤية شاملة

Full-spectrum observability: metrics, logs, traces, profiles, and cost monitoring in a unified stack.

الأركان الأربعة للمراقبة

المقاييس

Mimir

بيانات سلاسل زمنية مع PromQL. احتفاظ طويل الأمد، استعلامات عرض شامل، API متوافق مع Thanos.

السجلات

Loki

تجميع سجلات مفهرسة بالتسميات. استعلامات LogQL، أقل مساحة تخزين، متوافق مع Prometheus.

التتبع

Tempo

تتبع موزع مع دعم Jaeger/Zipkin/OTLP. ارتباط التتبع-إلى-السجل والتتبع-إلى-المقياس.

الملفات التعريفية

Pyroscope

تحليل أداء مستمر للمعالج والذاكرة والروتينات. رسوم بيانية لهبية، مقارنة زمنية.

خط أنابيب القياس عن بُعد

Grafana Alloy هو الوكيل الوحيد الذي يجمع كل شيء — بديل Promtail وGrafana Agent وOTel Collector.

graph LR
  WL["Workloads
DaemonSet on every node"]:::workloads
  ALLOY["Grafana Alloy
StatefulSet + DaemonSet"]:::alloy
  MIMIR["Mimir
Metrics"]:::storage
  LOKI["Loki
Logs"]:::storage
  TEMPO["Tempo
Traces"]:::storage
  PYRO["Pyroscope
Profiles"]:::storage
  GRAF["Grafana
SSO via Keycloak"]:::grafana

  WL -->|"scrape + forward"| ALLOY
  ALLOY -->|"metrics"| MIMIR
  ALLOY -->|"logs"| LOKI
  ALLOY -->|"traces"| TEMPO
  ALLOY -->|"profiles"| PYRO
  MIMIR --> GRAF
  LOKI --> GRAF
  TEMPO --> GRAF
  PYRO --> GRAF

  classDef workloads fill:#1e293b,stroke:#e2e8f0,color:#e2e8f0,stroke-width:2px
  classDef alloy fill:#0e3a3a,stroke:#06b6d4,color:#67e8f9,stroke-width:2px
  classDef storage fill:#2e2a0e,stroke:#facc15,color:#fde68a,stroke-width:2px
  classDef grafana fill:#14332a,stroke:#4ade80,color:#86efac,stroke-width:2px

جميع المكونات

Prometheus

production

Pull-based metrics collection with multi-dimensional data model and powerful PromQL query language.

الدور: Primary metrics scraping for all platform services via ServiceMonitors

Grafana

production

Visualization platform connecting metrics, logs, traces, and profiles in unified dashboards.

الدور: Central observability UI with pre-built dashboards for every platform component

Mimir

production

Horizontally-scalable long-term metrics storage with Prometheus-compatible API.

الدور: Indefinite metrics retention with high compression and fast queries

Loki

production

Log aggregation system inspired by Prometheus — indexes labels, not full log lines.

الدور: Centralized logging with LogQL queries across all namespaces

Tempo

production

Distributed tracing backend supporting Jaeger, Zipkin, and OpenTelemetry formats.

الدور: End-to-end request tracing across microservices

Pyroscope

production

Continuous profiling platform for CPU, memory, goroutine, and lock contention analysis.

الدور: Runtime performance profiling with flame graph visualization

Grafana Alloy

production

Unified telemetry collector replacing Promtail, Grafana Agent, and OpenTelemetry Collector.

الدور: Single agent collecting metrics, logs, traces, and profiles from all nodes

OpenCost

production

Real-time Kubernetes cost monitoring with per-namespace and per-workload breakdown.

الدور: Infrastructure cost visibility and optimization recommendations