CATALOGUE DES COMPOSANTS

Le Stack complet

57 composants de production répartis dans 9 catégories — tous open-source, tous éprouvés.

🏗️ Infrastructure & OS (3) 🌐 Networking & Service Mesh (5) 🛡️ Security & Identity (7) 📊 Observability (9) 🔄 GitOps & CI/CD (4) 💾 Storage & Registry (6) 🗄️ Databases & Messaging (6) 🚀 Application Platform (12) 🤖 AI & Machine Learning (5)

🏗️

Infrastructure & OS

The bare-metal foundation: immutable OS, container runtime, and cluster orchestration.

APPROFONDIR →

Talos Linux

production

Minimal, immutable Linux distribution designed specifically for Kubernetes. No SSH, no shell — managed entirely through API.

Rôle : Node operating system for all Talos cluster nodes (3 control plane + 3 workers, including 1 GPU worker)

containerd

production

Industry-standard container runtime with low overhead and broad compatibility.

Rôle : Container runtime on all nodes

Kubernetes

production

Production-grade container orchestration system for automating deployment, scaling, and management.

Rôle : Core orchestration platform running v1.35.2

🌐

Networking & Service Mesh

eBPF-powered networking, Gateway API ingress, service mesh, and DNS resolution.

APPROFONDIR →

Cilium

production

eBPF-based networking, observability, and security. Replaces kube-proxy with high-performance service load balancing.

Rôle : CNI plugin, network policy enforcement, L2 ARP announcement, Gateway API implementation

Hubble

production

Network observability platform built on Cilium eBPF data plane for deep visibility into communication and behavior.

Rôle : Network flow observability, service dependency mapping

Gateway API

production

Next-generation Kubernetes ingress API with expressive routing, TLS termination, and traffic splitting.

Rôle : Single shared gateway handling all HTTP/HTTPS traffic at 192.168.0.200

APISIX

production

High-performance, cloud-native API gateway with rich traffic management features.

Rôle : Advanced API gateway for complex routing scenarios

CoreDNS

production

Flexible, extensible DNS server for Kubernetes service discovery.

Rôle : Cluster DNS with wildcard resolution for *.exitthecloud.eu

🛡️

Security & Identity

Zero-trust security: SSO, secrets management, policy enforcement, runtime detection, and certificate automation.

APPROFONDIR →

Keycloak

production

Enterprise identity and access management with OIDC, SAML, social login, and LDAP integration.

Rôle : Centralized SSO for all platform services — Vault, Harbor, Grafana, ArgoCD, OneDev, Devtron

HashiCorp Vault

production

Secrets management, encryption as a service, and privileged access management.

Rôle : HA deployment (3-replica Raft cluster) storing all platform secrets, DNS credentials, TLS certificates

External Secrets Operator

production

Kubernetes operator that synchronizes secrets from external stores into Kubernetes secrets.

Rôle : Bridges Vault ↔ Kubernetes: syncs secrets to pods, pushes certificates back to Vault

cert-manager

production

Automatic TLS certificate management with Let's Encrypt ACME protocol support.

Rôle : Automated certificate issuance via DNS-01 challenges with Cloudflare

Kyverno

production

Kubernetes-native policy engine for validation, mutation, and generation of resources.

Rôle : Enforces security policies, injects PriorityClasses via mutation for charts that don't propagate values, image proxy rewriting to Harbor

Kubescape

deployed

Kubernetes security platform for continuous scanning against NSA, MITRE, and CIS benchmarks.

Rôle : Compliance scanning and hardening recommendations

Open AppSec

deployed

ML-based web application firewall and API security.

Rôle : WAF protection for exposed services

📊

Observability

Full-spectrum observability: metrics, logs, traces, profiles, and cost monitoring in a unified stack.

APPROFONDIR →

Prometheus

production

Pull-based metrics collection with multi-dimensional data model and powerful PromQL query language.

Rôle : Primary metrics scraping for all platform services via ServiceMonitors

Grafana

production

Visualization platform connecting metrics, logs, traces, and profiles in unified dashboards.

Rôle : Central observability UI with pre-built dashboards for every platform component

Mimir

production

Horizontally-scalable long-term metrics storage with Prometheus-compatible API.

Rôle : Indefinite metrics retention with high compression and fast queries

Loki

production

Log aggregation system inspired by Prometheus — indexes labels, not full log lines.

Rôle : Centralized logging with LogQL queries across all namespaces

HolmesGPT

production

AI-powered DevOps and Kubernetes troubleshooting assistant.

Rôle : 24/7 background operator for autonomous investigation and scheduled health checks using Minimax LLM

Tempo

production

Distributed tracing backend supporting Jaeger, Zipkin, and OpenTelemetry formats.

Rôle : End-to-end request tracing across microservices

Pyroscope

production

Continuous profiling platform for CPU, memory, goroutine, and lock contention analysis.

Rôle : Runtime performance profiling with flame graph visualization

Grafana Alloy

production

Unified telemetry collector replacing Promtail, Grafana Agent, and OpenTelemetry Collector.

Rôle : Single agent collecting metrics, logs, traces, and profiles from all nodes

OpenCost

production

Real-time Kubernetes cost monitoring with per-namespace and per-workload breakdown.

Rôle : Infrastructure cost visibility and optimization recommendations

🔄

GitOps & CI/CD

Git-driven deployment pipelines with progressive delivery and infrastructure-as-code.

APPROFONDIR →

Argo CD

production

GitOps continuous delivery tool that reconciles desired state from Git with cluster state.

Rôle : Core GitOps engine with App-of-Apps pattern managing 43+ applications

Terraform

production

Infrastructure as Code for provisioning and managing cloud-agnostic resources.

Rôle : Manages Vault secrets, Keycloak OIDC clients, Grafana dashboards, Harbor config

OneDev

production

Self-hosted Git repository manager with integrated CI/CD pipelines and code review.

Rôle : Private Git hosting with container-based CI runners

Renovate

production

Automated dependency update tool that keeps Helm charts, container images, and package versions current.

Rôle : Continuous dependency updates via automated pull requests

💾

Storage & Registry

Distributed block storage, S3-compatible object storage, and secure container registry.

APPROFONDIR →

Longhorn

production

Cloud-native distributed block storage with 3-way replication, snapshots, and backups.

Rôle : Primary storage class for all stateful workloads with automatic replication

Harbor

production

Enterprise container registry with vulnerability scanning, image signing, and RBAC.

Rôle : Private registry with Trivy scanning, OIDC auth, and replication policies

Garage

production

S3-compatible distributed object storage designed for self-hosted deployments.

Rôle : Cost-effective object storage for backups, logs, and unstructured data

RustFS

production

High-performance S3-compatible object storage written in Rust with WebUI console and OIDC SSO.

Rôle : Primary S3 object storage with Keycloak SSO, NFS-backed persistence, and admin console

Velero

deployed

Kubernetes backup and disaster recovery tool with snapshot and restore capabilities.

Rôle : Cluster-wide backup to S3 with scheduled policies

NFS CSI Driver

production

Kubernetes CSI driver for NFS file shares with dynamic provisioning support.

Rôle : NFS storage provisioner for shared file access across pods

🗄️

Databases & Messaging

Managed PostgreSQL, Redis-compatible cache, distributed KV store, Kafka streaming, and multi-model databases.

APPROFONDIR →

CloudNativePG

production

Kubernetes operator for PostgreSQL with HA clustering, automated failover, and point-in-time recovery.

Rôle : Manages PostgreSQL clusters for 5+ applications (Keycloak, Paperless-ngx, Matomo, Honcho, n8n)

Dragonfly

production

Redis-compatible in-memory data store with superior performance through modern algorithms.

Rôle : High-performance caching layer replacing Redis

Strimzi (Apache Kafka)

production

Kubernetes operator for Apache Kafka with native CRD-based management.

Rôle : Event streaming platform for asynchronous communication

TiKV

production

Distributed transactional key-value store with ACID transactions and Raft consensus.

Rôle : Backend storage engine for SurrealDB with strong consistency

SurrealDB

production

Multi-model database supporting document, graph, and key-value data models.

Rôle : Flexible database for applications needing graph + document queries

Qdrant

production

Vector database for similarity search, powering semantic search and AI applications.

Rôle : Vector embeddings store for AI/ML workloads

🚀

Application Platform

Developer portal, BaaS, workflow automation, analytics, and self-service tools.

APPROFONDIR →

Supabase

production

Open-source Firebase alternative: PostgreSQL, auth, real-time, storage, and edge functions.

Rôle : Backend-as-a-Service for rapid application development

n8n

production

Self-hosted workflow automation with 400+ integrations and visual builder.

Rôle : Event-driven automation for platform operations and notifications

Matomo

production

Privacy-focused web analytics platform — self-hosted Google Analytics alternative.

Rôle : Visitor tracking without third-party data sharing

Homepage

production

Application dashboard providing a unified start page for all platform services.

Rôle : Central dashboard linking all 30+ platform services

Paperless-ngx

production

Document management system with AI-powered tagging, OCR, and full-text search.

Rôle : Digitize, index, and archive documents with automatic classification

Firecrawl

production

AI-ready web data extraction platform that crawls and converts web pages to clean LLM-ready data.

Rôle : Web scraping and data extraction for AI pipelines

Stirling-PDF

production

Self-hosted PDF manipulation toolkit — merge, split, convert, compress, and OCR.

Rôle : PDF processing utilities for document workflows

IT-Tools

production

Collection of handy developer utilities — encoders, converters, generators, and network tools.

Rôle : Developer utility dashboard for day-to-day tasks

Honcho

production

Agent memory and state management for building context-aware AI applications.

Rôle : Persistent memory layer for AI agent workflows

Mailpit

production

Email testing tool that captures SMTP messages for development and QA environments.

Rôle : Email sandbox for testing notification and transactional emails

Devtron

production

Kubernetes application lifecycle management platform with CI/CD, debugging, and deployment automation.

Rôle : Application delivery dashboard for Kubernetes workloads with OIDC SSO

Rclone

production

Cloud storage sync and management tool supporting 70+ providers with WebUI.

Rôle : File sync and backup between local storage and S3/cloud endpoints

🤖

AI & Machine Learning

Edge AI inference on NVIDIA DGX Spark with Blackwell GPU — LLM model serving via AIBrix and vLLM on bare-metal Kubernetes.

APPROFONDIR →

NVIDIA DGX Spark

production

Desktop AI supercomputer powered by Grace Blackwell GB10 Superchip — 1 PFLOP FP4, 128GB unified LPDDR5x memory, ARM64 architecture.

Rôle : Dedicated GPU worker node (gx10) with Blackwell GPU, CUDA 13.0, and ConnectX-7 networking

AIBrix

production

Open-source Kubernetes-native AI inference platform with prefix-cache-aware routing, LLM-specific autoscaling, and distributed KV cache.

Rôle : LLM model serving control plane — 3-wave ArgoCD deployment with Envoy Gateway routing

vLLM

production

High-throughput LLM inference engine with PagedAttention, continuous batching, and OpenAI-compatible API.

Rôle : Inference runtime serving Qwen, Llama, and Mistral models via NVIDIA NGC images on ARM64

NVIDIA GPU Operator

production

Kubernetes operator automating GPU driver, container toolkit, device plugin, and DCGM exporter lifecycle.

Rôle : GPU resource management with driver-less mode for DGX OS — exposes nvidia.com/gpu to scheduler

Hindsight

production

Temporal semantic memory system for AI agents — retain, recall, and reflect operations backed by pgvector similarity search.

Rôle : Agent memory layer with GPU-accelerated local embeddings and reranking, powered by minimax LLM