AMD Strix Halo RDMA Cluster Setup Guide
- AI
- Hardware
- Open Source
- Infrastructure
The post is a hands-on guide for wiring up a two-node AMD Strix Halo cluster over RDMA, using 100Gb network cards so two 128GB unified-memory machines can share work on larger local models. The appeal is simple: prosumer hardware can now reach 256GB of pooled memory, which puts models that were previously server-only within reach of a home lab or small team. People also connected it to antirez’s DS4 work on DeepSeek and other big local models, where unified memory on Strix Halo or Macs is attractive because it can hold models that do not fit on ordinary consumer GPUs.
If you are evaluating local AI hardware, this guide is a useful proof that multi-box Strix Halo clustering is now practical, not just theoretical. But buying decisions should hinge on total system cost, bandwidth bottlenecks, and thermals, because the cluster only makes sense if you value unified-memory capacity more than raw tokens per second.
- github.com
- Discuss on HN