Launch HN: Expanse (YC P26) – Unlock Wasted GPU Capacity
- AI
- Infrastructure
- Developer Tools
- Startups
Expanse says GPU and HPC clusters waste huge amounts of capacity because users ask for far more walltime, memory, and compute than their jobs usually need. The company installs alongside SLURM or Kubernetes, reads submission scripts, source code, and live node telemetry, then predicts resource needs, likely failures, and code-level fixes before the job runs. Their key claim is that this is not an LLM wrapper. It is a cluster-specific multimodal model that learns how a particular environment behaves, because the same workload can perform very differently across hardware topologies.
If you operate expensive shared compute, the biggest near-term win may be better prediction and visibility around memory, walltime, and bursty job phases rather than smarter scheduling alone. If you build in this market, security posture and deployment model need to be legible up front or buyers will dismiss you before they get to the technical value.
- news.ycombinator.com
- Discuss on HN