AI, Ashby Engineering, and the future

AI
Programming
Developer Tools
Startups

Ashby’s post lays out a pragmatic AI-assisted engineering model rather than a pure “replace programmers with agents” pitch. The company says engineers should use LLMs in two modes: a close-in “sidekick” mode for interactive help, and a more hands-off “delegate” mode for scoped tasks. It also insists that humans remain responsible for every line shipped, and tries to show that rising AI usage has not caused an obvious blowup in customer-reported issues.

That basic operating model landed better than the slogan sitting on top of it. Many readers hated the claim that the “cost of producing code is heading towards zero,” because it collapses the expensive parts of software into typing. The stronger reading of the post was that keystrokes are getting cheaper while the hard parts stay put: deciding what to build, communicating intent, reviewing changes, checking correctness, and absorbing the cost of failure. That is why the most useful part of the piece for experienced engineers was the split between sidekick and delegate workflows. Several people said sidekick mode matches reality better because code review and understanding quickly become the bottleneck when an agent can spray out large diffs. The biggest skepticism was aimed at Ashby’s evidence. The chart tying AI adoption to stable customer issue rates looked too thin to support the thesis, especially because it covered a short time window, bundled usage into crude buckets, and waved away a visible seasonal spike as a recurring blip. The author conceded that the data does not prove AI keeps quality stable. At best it suggests there was no immediate bug apocalypse. That admission sharpened the real takeaway: if AI is helping, the proof is not in a high-level vanity chart. It is in whether a team can keep review quality high while output rises. That led straight into the incentive problem. Readers did not dispute that “you are responsible for what you ship” sounds good. They doubted most organizations will enforce it when promotion systems still reward visible velocity more than legible quality. Ashby’s leaders replied that they inspect PRs during performance reviews and call out both weak code structure and strong communication. That answer reassured some people, but it also underlined how management-heavy this model is. AI-assisted engineering only works if technical managers are close enough to the work to spot when people are using the tools to think better versus using them to dump review debt on everyone else. A second thread widened the lens from coding to software businesses. Even if code gets much cheaper, several readers argued that this does not mean vertical SaaS markets suddenly fragment into thousands of winners. Enterprise buyers care about trust, depth, support, vendor risk, and operational maturity, not just feature checklists. Ashby’s own product became the example. Some hiring managers said it feels merely adequate or bloated, while the founders answered that they intentionally optimized for flexibility and coverage first because simple, elegant products keep losing in ATS buying processes. That exchange made the broader point more concrete than the AI memo itself: cheap code may lower entry costs, but it does not erase the messy commercial work of building serious software companies.

Treat AI coding policy as an org design problem, not a tooling rollout. If you cannot measure review load, defect rates, and performance incentives clearly, faster code generation will just move the bottleneck into product thinking, validation, and cleanup.

June 4, 2026
ashbyhq.com
Discuss on HN

Key insights

Sidekick mode matches real engineering work

Used this way, AI helps with drafting and exploration while the engineer stays inside the loop on intent, tradeoffs, and review. That framing is more credible than full delegation because the expensive work in software is still planning, communication, validation, maintenance, and managing failure modes like leaking customer PII.

Default to AI as an interactive assistant for most production work. Reserve autonomous generation for tightly bounded tasks where review is genuinely cheap and success is easy to verify.

Attribution:

samstokes #1 #2

Accountability only works with review-backed performance management

“You own every line” is empty unless managers inspect actual artifacts and penalize low-quality output even when it looks fast. Ashby’s answer was not more policy language. It was sampling PRs and specs in performance reviews so code structure, explanation quality, and debugging clarity show up in promotion decisions.

If you want responsible AI use, bake code quality and explanation quality into reviews with real examples from shipped work. Otherwise teams will optimize for visible throughput and bury the cleanup cost.

Attribution:

annoyingcyclist #1
colinhowe #1 #2

Review bandwidth is the hard limit

The bottleneck is not generating more code. It is whether humans actually read and understand what is being merged. One commenter described teams shipping worse bugs right after heavy AI adoption, and Ashby’s own response focused on technical managers and added tests rather than pretending review scales automatically with output.

Track how large AI-assisted diffs are, who reviews them, and how often defects escape afterward. If review quality drops as output rises, cut scope before you add more generation.

Attribution:

colinhowe #1
SpicyLemonZest #1

Enterprise software wins on coverage and flexibility first

Ashby’s founders and users sketched the same market reality from opposite sides. Hiring managers may see Ashby as just another acceptable ATS, or even bloated, but the company says it had to ship broad, customizable functionality fast because simpler and more elegant entrants keep losing to buyer requirements.

Do not confuse cheaper code with an easier go-to-market. In enterprise categories, depth, configurability, and support still decide deals long after the MVP is trivial to build.

Attribution:

Ben-G #1
darkwater #1
abhikp #1

Against the grain

AI output may just create janitorial work

This view rejects the article’s productivity framing entirely. Faster generation can turn a small group into cleanup staff for a larger group that now produces more low-quality code, while others comply performatively by routing normal work through paid AI tools to look aligned with management fashion.

Look for asymmetric review and maintenance load across the team after AI rollout. If a few strong engineers are absorbing the mess, your reported productivity gains are fake.

Attribution:

conartist6 #1

The bigger problem is missing engineering standards

This argument says the industry is overfocused on whether individuals think harder with AI, when the deeper failure is that software work remains too ad hoc. Mature fields reduce judgment calls with strong standards and reusable patterns. Software still relies on improvised craft, which makes both human and AI output unreliable.

Use AI rollout as a forcing function to standardize architecture, interfaces, and implementation patterns. Without that groundwork, better generation tools will mostly accelerate inconsistency.

Attribution:

0xbadcafebee #1

AI-written policy weakens trust in the message

Some readers were put off before they even got to the substance because the article read like it may have been AI-assisted. For a post asking engineers to trust management’s judgment about responsible AI use, that presentation choice undercuts credibility instead of reinforcing it.

For internal or public policy on AI adoption, write in a voice that clearly reflects human judgment. If the memo itself feels machine-smoothed, people will doubt the rigor behind the policy.

Attribution:

weakfish #1
pullshark91 #1

In plain english

ATS ↩

Applicant Tracking System, software companies use to collect applications and manage hiring workflows.

PII ↩

Personally identifiable information, data that can identify a specific person.

SaaS ↩

Software as a Service, meaning software you use over the internet that runs on someone else’s servers.

vendor risk ↩

The business and operational risk of relying on an outside software supplier that may fail, change terms, or not meet requirements.

Reference links

Original story

AI, Ashby Engineering, and the future
The post under discussion about how Ashby is using AI in engineering and how it thinks the workflow will change.

AI, Ashby Engineering, and the future

Discussion mood

Key insights

Against the grain

In plain english

Reference links

Original story