Local Qwen isn't a worse Opus, it's a different tool
- AI
- Developer Tools
- Open Source
- Infrastructure
- Privacy
The post described one team’s experience running local coding models, mainly Qwen, on self-hosted GPU boxes. The core claim was simple: a local 27B or 35B-class model is not close to Opus on long, messy coding tasks, but it still earns its keep because it gives you privacy, fixed behavior, low marginal cost, and tight control over workflows that cloud models cannot safely touch. The author framed local models as especially good at codebase reading, repetitive tool use, and work in regulated or air-gapped environments, while admitting they still loop, lose the plot on bigger tasks, and need careful setup.
If you use LLMs seriously, stop treating model choice as a simple leaderboard problem and build evals around your own workflow, harness, and privacy constraints. For local deployments, the winning pattern looks less like “replace Claude” and more like “use a fast, controllable model for cheap repetitive work, codebase understanding, and sensitive data, then escalate harder tasks.”
- blog.alexellis.io
- Discuss on HN