Show HN: Overfitted a 900KB Transformer to Compress a 100MB CSV into 7MB
- AI
- Infrastructure
- Open Source
- Developer Tools
The post shows a file-specific lossless compressor: train a 900 KB transformer to overfit a single file, use its byte predictions as probabilities for arithmetic coding, and ship both the coded residual and the model weights. On a 100 MB NYC taxi CSV, the author reports about 7 MB output. On a 100 MB slice of enwik9, the result is about 21 MB including the model. Runtime is the obvious catch. Training takes 20 to 30 minutes and compression and decompression each take about 45 minutes on a consumer GPU.
Treat this as a research curiosity, not a deployable codec. If you work on compression, the practical benchmark is still against strong domain codecs like ZPAQ and LZMA with published datasets, fixed settings, and total size including model weights.
- news.ycombinator.com
- Discuss on HN