Transformers are inherently succinct
- AI
- Machine Learning
- Research
- Verification
The paper is a complexity-theory result about the transformer architecture, not a benchmark on chatbots. Its core claim is that transformers can be exponentially more succinct than linear temporal logic, meaning they can describe some languages with far smaller representations than older symbolic formalisms. The sting is in the corollary. Once a model class gets that compact, even basic questions like whether it accepts anything at all or whether two models are equivalent become EXPSPACE-complete in the worst case. That landed as the practical headline for most readers: if you want a system whose behavior you can prove, a transformer is a bad substrate for the thing being proved.
If you are betting on LLMs in safety-critical or formally verified systems, assume verification will stay a major bottleneck and design hard boundaries around what the model is allowed to do. Also do not overread a complexity-theory result as a direct statement about today's trained models or product behavior.
- openreview.net
- Discuss on HN