Open Reproduction of DeepSeek-R1
- AI
- Open Source
- Machine Learning
- Developer Tools
The submitted repo is Hugging Face's Open-R1 project, an attempt to recreate DeepSeek-R1 in the open. The page sounds ambitious, but the key update buried in the repo is that only step 1 is done: a 350,000-example reasoning dataset called Mixture-of-Thoughts and a recipe for a 7B distilled model meant to match DeepSeek-R1-Distill-Qwen-7B. People reading closely said that is not the same thing as reproducing the full R1 model or its training pipeline.
If you care about genuinely reproducible LLM training, treat Open-R1 as a partial artifact and benchmark it against projects like OLMo, Nemotron, and OpenThoughts that expose more of the stack. For strategy and budgeting, assume that "open" and "reproduced" still need to be checked line by line, especially around datasets, validators, and training recipes.
- github.com
- Discuss on HN