Self-hosted · open beta · v0.8.3
LLM Cluster
Distributed GPU sharing
String idle GPUs into your own compute pool. Team-shared scheduling, priority queues, transparent costs — like Slurm, but for the LLM era.
beta · v0.8.3
› $ cluster status
› nodes: 4 online · 12× A6000 · 8× 4090
› queue: 3 pending · 2 running · 18 done
› $ cluster submit train.yml
› job-471 → node-2 (RTX A6000)
› eta 1h 24m · cost est $1.18
$
deploySelf-hosted (k8s)
GPUNVIDIA · CUDA 12+
schedpriority + fair-share
licenseopen core
§ 01How it works
Idle GPUs → shared compute pool.
§ 01
Register nodes
Install agent on any GPU box — desktop, server, or cloud. Auto-detects.
§ 02
Submit jobs
YAML config or one-line CLI. Choose GPU class, deadline, priority.
§ 03
Monitor + bill
Live dashboard, per-user cost, exportable usage reports.
§ 02Features