2026 Homelab Infrastructure
Layered homelab platform for local AI inference, media automation, storage, platform engineering practice, and a long-term path toward MLOps engineering.
Overview
This project documents a 2026 homelab built around operational separation of concerns.
The environment is split into four layers:
- Compute for GPU-backed inference
- Control for routing and orchestration
- Data for media, downloads, and storage
- Interface for operator access and development
The goal is to keep stable services reliable while preserving room for experimentation. It is also the environment I am using to learn the operational side of AI systems, including local model serving, request routing, and the infrastructure patterns that lead toward MLOps work.
Architecture
AI Node
Dedicated GPU compute host for model serving.
- Platform: AM4
- GPU: RTX 4070 Ti Super, 16 GB VRAM
- RAM target: 64 GB
- OS: Ubuntu Server 24.04
- Services:
vLLMOllamaComfyUIplanned
- Ports:
8000forvLLM11434forOllama
Control and Lab Node
Optiplex 3080 SFF running Proxmox.
- CPU: Intel i5
- RAM target: 32 to 64 GB
- OS: Proxmox
VMs:
- Gateway VM
- FastAPI request routing
- Nginx or Caddy reverse proxy
- logging and metrics
- ports
80/443and8080
- Rocky Linux VM
- RHCSA practice
- SELinux,
systemd, storage, networking, users and groups
- Lab VM
- Docker testing
- k3s evaluation
- temporary workloads
Media and Storage Node
Stable media and storage host.
- Hardware: HP Elitedesk G4
- RAM target: 16 to 32 GB
- Storage:
2 x 4 TBHDD for media256 GBNVMe for OS
- OS: Ubuntu Server 24.04
- Services:
- Jellyfin
- Sonarr
- Radarr
- Prowlarr
- qBittorrent
- Immich
Ports:
8096Jellyfin8989Sonarr7878Radarr9696Prowlarr8081qBittorrent2283Immich
Storage paths:
/mnt/media/mnt/downloads/mnt/photos
Main Workstation
Primary operator interface into the environment.
- Platform: AM5
- GPU: RX 7800 XT or equivalent
- OS: Windows with optional dual boot
- Usage:
- SSH
- VS Code
- Parsec
- gaming
Network
IP Scheme
| Device | Hostname | IP |
|---|---|---|
| Router | gateway | 192.168.1.1 |
| AI Node | ai-node | 192.168.1.7 |
| Proxmox Host | proxmox | 192.168.1.8 |
| Gateway VM | ai-gateway | 192.168.1.20 |
| Rocky VM | rocky-lab | 192.168.1.21 |
| Media Node | media-node | 192.168.1.9 |
Request Flow
- Client traffic enters through
ai-gateway. - The gateway decides between local inference and cloud fallback.
- Local requests route to the AI node through
vLLM. - Cloud requests route to providers such as OpenAI or Anthropic.
- Logging and metrics remain centralized at the gateway layer.
- Media and storage workloads remain isolated on the media node.
flowchart TD WS[Main Workstation<br/>AM5 PC] --> GW[Gateway VM<br/>FastAPI + Reverse Proxy] GW --> AI[AI Node<br/>vLLM + Ollama] GW --> CLOUD[Cloud AI Providers<br/>OpenAI / Anthropic] GW --> MEDIA[Media Node<br/>ARR Stack + Immich] PX[Optiplex 3080<br/>Proxmox Host] --> GW PX --> ROCKY[Rocky Linux VM<br/>RHCSA Practice] PX --> LAB[Lab VM<br/>Docker / k3s]
Tech Stack
- Proxmox for virtualization
- Ubuntu Server 24.04 for bare-metal Linux hosts
- Rocky Linux for RHCSA practice
- FastAPI for request routing
- Nginx or Caddy for reverse proxying
vLLMand Ollama for local AI serving- Jellyfin, Sonarr, Radarr, Prowlarr, qBittorrent, and Immich for media services
- NFS-backed shared storage where appropriate
Current Status
Completed
- Ubuntu installed on the AI node
- NVIDIA drivers working
In Progress
- Proxmox deployment on the Optiplex
- Ubuntu deployment on the Elitedesk
Planned
vLLMdeployment- FastAPI gateway
- ARR stack deployment
- Immich deployment
- Cloud fallback routing
- Logging and metrics
Design Principles
- One machine, one role
- Local-first AI with cloud fallback
- Stable systems stay stable
- Complexity must justify itself
Future Improvements
- k3s cluster evaluation
- Monitoring stack
- CI/CD pipelines
- Auth and rate limiting
- Tailscale for external access
What This Project Demonstrates
- Infrastructure design with clear operational boundaries
- GPU-backed local model serving
- Hybrid local and cloud inference routing
- Hands-on progress toward MLOps through model-serving infrastructure
- Separation of stable services from experimental systems
- Practical systems design beyond single-host container deployment