🏗️ INFRASTRUCTURE

أساس المعدن العاري

The bare-metal foundation: immutable OS, container runtime, and cluster orchestration.

طوبولوجيا الكلستر

Control Plane (3 nodes — HA with Raft consensus)
  ├── controlplane-100  192.168.0.100  Ready
  ├── controlplane-101  192.168.0.101  Ready
  └── controlplane-102  192.168.0.102  Ready

API Server VIP (ARP-based failover)
  └── k8s.edgeprime.io  192.168.0.250  Active  ~10s failover

Worker Nodes (5 nodes — workload execution)
  ├── worker-103        192.168.0.103  Ready
  ├── worker-104        192.168.0.104  Ready
  ├── gpu-105           192.168.0.151  Ready  3× NVIDIA GPU
  ├── worker-106        192.168.0.150  Ready
  └── gx10              192.168.0.224  Ready  GB10 Blackwell

OS: Talos Linux v1.12.6 (immutable, API-managed, no SSH)
Runtime: containerd
Kubernetes: v1.35.3
API: https://k8s.edgeprime.io:6443 (VIP: 192.168.0.250)

جرد العقد

مواصفات الأجهزة لكل عقدة في الكلستر — من أجهزة مكتبية مُعاد استخدامها إلى NVIDIA DGX Spark.

العقدة المعالج النوى الذاكرة التخزين GPU نظام التشغيل
controlplane-100 (192.168.0.100) AMD A8 PRO-7600B 4 16 GB 256 GB SSD Talos v1.12.6
controlplane-101 (192.168.0.101) Intel Pentium G3250T 2 8 GB 256 GB SSD Talos v1.12.6
controlplane-102 (192.168.0.102) Intel i5-4690K 4 16 GB 512 GB SSD Talos v1.12.6
worker-103 (192.168.0.103) Intel i5-7500T 4 24 GB 512 GB NVMe Talos v1.12.6
worker-104 (192.168.0.104) Intel i5-7500T 4 24 GB 512 GB NVMe Talos v1.12.6
gpu-105 (192.168.0.151) AMD Ryzen 5 5600X 12 96 GB 512 GB NVMe RTX 2060 + 2× RTX 3060 Ti Talos v1.12.6
worker-106 (192.168.0.150) Intel i7-4790K 8 16 GB 128 GB SSD Talos v1.12.6
gx10 (192.168.0.224) NVIDIA Grace 20 128 GB 1 TB GB10 Blackwell Ubuntu 24.04 (DGX OS)
إجمالي النوى 58 إجمالي الذاكرة: 328 GB

المكونات الأساسية

Talos Linux

production

Minimal, immutable Linux distribution designed specifically for Kubernetes. No SSH, no shell — managed entirely through API.

الدور: Node operating system for all Talos cluster nodes (3 control plane + 3 workers, including 1 GPU worker)

containerd

production

Industry-standard container runtime with low overhead and broad compatibility.

الدور: Container runtime on all nodes

Kubernetes

production

Production-grade container orchestration system for automating deployment, scaling, and management.

الدور: Core orchestration platform running v1.35.2