Reviving Papers with Code

AI
Open Source
Developer Tools
Research

The post introduced a new Papers with Code site built at Hugging Face to restore what the original service did for AI research: connect papers to code, organize work by task and method, and surface benchmark results in one place. The current version covers AI domains only and already includes trending papers, task pages, methods, conference pages, citation counts, linked repositories, support for papers outside arXiv, and manually verified leaderboard entries that are first extracted with AI agents. The creator framed it as a practical revival of a resource that many people felt Meta had left to decay.

If you track AI research, this looks worth bookmarking again, especially for benchmark-heavy areas where “state of the art” claims are noisy. The product gap is now less about raw paper discovery and more about structured metadata, better search, alerts, and trustworthy result verification.

June 10, 2026
paperswithcode.co
Discuss on HN

Discussion mood

Mostly enthusiastic and relieved. People missed the original utility of Papers with Code, see a real need for structured benchmark tracking, and view Hugging Face as a more credible home for a community research index than Meta was. The main complaints were about missing product features and the generic AI-generated visual design, not the core idea.

Key insights

Why Papers with Code mattered operationally

The old service did more than collect links. It imposed structure on a chaotic AI research workflow by tying papers, code, tasks, and reproducibility together in a way that saved practitioners from rebuilding the map themselves. That context sharpens why its return matters now. The volume of AI output is higher, the tooling is better, and the need for a maintained shared index has not gone away.

Treat this category of product as research infrastructure, not content. If you build for technical teams, the durable value is in structured metadata and verification layers that cut decision time.

Attribution:

jeffreysmith #1

The next win is better metadata

What users are asking for is not more paper pages. They want dataset-aware filtering, benchmark selection, semantic search, related-paper discovery, and notifications that fit an actual research workflow. That shifts the product challenge from scraping papers to building a usable decision surface over the literature, where the hard part is linking claims to the exact datasets, tasks, and eval setups people care about.

If you work on research discovery, prioritize normalized metadata over homepage ranking tweaks. Dataset, benchmark, and eval-schema extraction will likely create more user value than adding more raw coverage.

Attribution:

somethingsome #1
addandsubtract #1
caldarons #1
steinvakt2 #1

Reproducibility breaks outside idealized benchmarks

Calls to reject papers without code ran into a harder reality in medicine and other expensive-data fields, where the data itself is closed and results cannot be independently checked even after peer review. That exposes a limit of the whole Papers with Code model. Code availability helps, but it does not solve verification when the underlying dataset is inaccessible.

Do not treat “has code” as a full reproducibility signal. In any evaluation or procurement workflow, separate code availability from data access and independent replication.

Attribution:

nicce #1
lalaland1125 #1

Researchers want completeness, not just trending

The RSS discussion exposed a split between discovery for what is hot and discovery for field coverage. For some users, a trending page is useful. For others, especially people trying to map a domain, the key need is a complete feed of papers with code in a category, even with some false positives. That is a different product from a ranked homepage.

Offer both ranked discovery and exhaustive feeds if you build research tools. Teams monitoring a space for strategy or diligence need full coverage, not only popularity signals.

Attribution:

marcindulak #1 #2

Against the grain

The name suggests a broader science catalog

Several readers bounced off the AI-only scope because “Papers with Code” sounds like a general index of reproducible research, not a vertical AI property. That criticism changes the framing of the launch. For people outside ML, the product feels narrower than the brand promise, even if the implementation is good.

If this stays AI-focused, make that obvious in navigation and positioning. Otherwise users from other fields will keep arriving with the wrong expectations and leaving disappointed.

Attribution:

cyril_st_john #1
Sharlin #1

The design looks like generic AI product slop

The harshest pushback was not about functionality but about aesthetics. The site’s visual style read to some people as the same interchangeable AI-generated startup design now showing up everywhere, which undermines trust for a product that is supposed to feel like durable research infrastructure.

For developer and research audiences, visual polish is less important than a distinctive, stable interface. If the product is meant to become a daily tool, generic marketing-site cues can work against credibility.

Attribution:

imadr #1
Zopieux #1

In plain english

AI ↩

Artificial intelligence, software systems that perform tasks such as generating text, code, images, or decisions.

arXiv ↩

A public online repository where researchers post preprints of papers before or alongside formal publication.

Atom ↩

A web feed format similar to RSS that lets users subscribe to site updates.

Hugging Face ↩

A company and open source platform known for tools, models, datasets, and community infrastructure for machine learning and AI research.

LLM ↩

Large language model, a type of AI trained on huge amounts of text that can generate and edit language and code.

Meta ↩

The parent company of Facebook that also does major AI research and product development.

ML ↩

Machine learning, a branch of artificial intelligence where systems learn patterns from data instead of being programmed with fixed rules.

Papers with Code ↩

A website that links research papers to their source code and organizes them by tasks, methods, and benchmark results.

RSS ↩

Really Simple Syndication, a web feed format used to subscribe to updates from websites.

SOTA ↩

State of the art, meaning the best-performing system currently available on a given benchmark or task.

Reference links

Project and product links

Revived Papers with Code
The new site being launched and discussed.
Hugging Face Trending Papers
Referenced as a related Hugging Face paper discovery page and possible overlap with the new site.

Feature requests and adjacent tools

Papers with Code data issue requesting RSS or Atom
Used to show that feed support was requested before and disappeared with the original service.
Code Available Feed
An independent Atom feed of arXiv papers with code offered as an alternative for complete category monitoring.
Turing project on GitHub
Shared as an unfinished attempt at a broader reproducible paper index beyond AI.
planetlambert GitHub profile
Linked as a better entry point to related repositories and similar sites.

Reading and commentary on papers

The Morning Paper
Named as the remembered blog that distilled one technical paper at a time for practitioners.
Two Minute Papers YouTube channel
Mentioned as another paper-summary format, though with criticism that it changed over time.

Design criticism references

Performative UI satire
Shared to mock the generic AI product design style some people felt the site used.
Related Hacker News design discussion
Linked as another discussion about the same generic AI web design aesthetic.