Tokenomics: Quantifying Where Tokens Are Used in Agentic Software Engineering
- AI
- Developer Tools
- Economics
The paper tries to put numbers on a question most teams are still hand-waving away: where do tokens actually get burned when you use agentic coding systems. Across 30 software engineering tasks, it reports that code review and debugging are the biggest sinks and that input tokens make up the majority of usage on average. That lines up with what many people are seeing in practice. Agents spend far more time reading code, tool outputs, and prior context than producing code. For large codebases, that imbalance can get extreme. Several people said they regularly see input-heavy ratios closer to 10:1, with agents ingesting huge amounts of context to make tiny edits.
If you are deploying coding agents, start tracking token spend by workflow stage now instead of treating it as one blended bill. The obvious levers are reducing context bloat, caching repeated prefixes, and pushing cheap models into refinement, review, and other grunt work before you negotiate anything with vendors.
- arxiv.org
- Discuss on HN