SeaBiscuit Documentation

Reference for installing, operating, and tuning SeaBiscuit on‑prem.

System requirements (validated tiers)

The table lists the lowest configurations we have tested end‑to‑end. Hardware below the Baseline tier may still work for smaller or quantised models, but it is not yet covered by support SLAs.

Tier	Use case	CPU	RAM	GPU / VRAM	NIC
Baseline	Models ≤ 13 B params, 4‑bit or FP16	4 cores (AVX2)	16 GB	Optional; any PCIe GPU	10 GbE
Large‑model	30 B–70 B param splits	8 cores (AVX‑512)	32 GB	≥1 GPU with ≥16 GB VRAM (A100/H100, MI‑300, etc.)	25 GbE RDMA or HDR InfiniBand

Both tiers assume at least 50 GB of free NVMe for model files and logs.

Network guidelines

SeaBiscuit adapts its partitioning to the available fabric. To split one model across hosts you need sustained bandwidth ≥ 3 GB/s (25 GbE) or InfiniBand FDR/HDR.

Fabric	Minimum	Preferred
Ethernet	25 GbE RDMA/ROCE	100 GbE
InfiniBand	FDR 56 Gb/s	HDR 200 Gb/s

10 GbE networks are supported but the planner keeps tensors >1 MB on the same host.

Placement config

# /etc/seabiscuit/policy.yaml
prefer_same_rack: true
max_block_memory_gb: 16
cpu_block_allow:
  - layer_norm
  - softmax

Monitoring

Metrics at /metrics (Prometheus). Grafana dashboard JSON in dashboards/.

Glossary

Term	Definition
AVX2 / AVX‑512	CPU SIMD extensions; speed up FP16 math on CPUs.
GbE	Ethernet rated in gigabits per second.
FDR / HDR	InfiniBand generations at 56 Gb/s and 200 Gb/s.
Min‑cut	Graph partition algorithm SeaBiscuit uses to split models.
ROCE	RDMA over Converged Ethernet.

FAQ

Can I run on a single laptop?

Yes for small (≤7 B) models, but performance will be limited and not covered by support.

Does SeaBiscuit support training?

Not yet – inference only.