I found 10k GitHub repositories distributing Trojan malware

Security
Open Source
Developer Tools
AI

The post lays out a large malware campaign on GitHub. Attackers clone or mimic real repositories, tweak names and README text for search ranking, then send users to external ZIP archives that contain Trojan malware. The repos are kept artificially fresh by deleting and re-pushing commits, which helps them float to the top in GitHub and web search results. Several people said they had seen their own projects copied this way, or had reported near-identical repos over the past year, which makes the 10,000-repo count feel less like a freak find and more like one visible slice of an industrialized operation.

Treat GitHub discovery as untrusted distribution, not as a safety signal. If your team installs tools from repos found through search, add isolation, provenance checks, and malware scanning before anything touches a developer workstation or secrets.

June 18, 2026
orchidfiles.com
Discuss on HN

Key insights

Malware is spilling into repo-indexing sites

The campaign is not confined to GitHub search results. Copycat projects are also showing up on sites like decision.ai, Lobehub, and MCP Market under real developers’ names, which extends the abuse into the new layer of AI tool and plugin directories. The browser-verification interstitials on some of those sites make the whole flow look even more like a social engineering funnel than a simple mirror problem.

If your product surfaces GitHub projects, plugins, or MCP tools, assume your index can become part of the infection path. Add ownership verification and aggressive abuse review before you let third-party project pages inherit trust from a real maintainer name.

Attribution:

Jimmc414 #1
schrodinger #1

The malware family looks trackable

One sample was matched through Genus Codes and VirusTotal to the Disco Trojan family, which suggests this campaign is not just random junk dropped into ZIP files. There are enough recurring artifacts that outside defenders can cluster samples and build detections, and maintainers can strengthen authenticity with public identity proofs like Keyoxide even if that does not stop every victim.

Security teams should collect and cluster samples instead of treating each repo as an isolated incident. Maintainers who publish software directly should add verifiable identity links from their domain or other controlled accounts so users have a second way to confirm provenance.

Attribution:

jp0001 #1
beej71 #1

Simple heuristics probably catch a lot

Fresh accounts, star bursts from equally fresh accounts, repeated repo recreation, and obvious README SEO stuffing are all signals people were already using by hand to spot bad repos. The sharp point here is not that perfect detection exists. It is that a large chunk of this abuse looks cheap to triage and expensive only at the review stage, yet the current system seems optimized to avoid signup friction rather than stop bad distribution early.

If you run a code marketplace, package index, or internal artifact portal, start with crude heuristics and human review instead of waiting for a perfect classifier. Blocking the low-effort flood will do more than polishing post-abuse reporting flows.

Attribution:

mustaphah #1 #2 #3
gleenn #1
pixl97 #1

Sandboxing only helps before trust is granted

Flatpak or similar isolation can reduce damage, but only if the malware runs inside a real sandbox and the user does not punch holes through it with broad permissions or sudo-driven setup steps. Once a machine is already compromised, hardware secrets and password tools still leave exposed browser sessions and cookies in reach. The practical line is that desktop sandboxing helps with casual malware, but it is not a substitute for separate environments when you run untrusted code.

Use disposable VMs or throwaway machines for unknown repos and installers. Keep desktop sandboxes turned on, but do not count on them to protect the same browser session and secrets you use for finance, email, or production access.

Attribution:

embedding-shape #1 #2 #3
criddell #1
Gigachad #1

Fake technical tests are a live delivery channel

The same malware pattern is showing up in developer hiring scams. A recruiter sends an attractive remote job, fast-tracks the candidate, then hands over a codebase or dependency set that infects the machine during review or install. One commenter said they now detonate these tests in cloned VMs because infection on first run is common enough to expect.

Warn engineering and recruiting teams that take-home assignments and unsolicited code reviews are now a malware vector. Route any external code exercise through disposable infrastructure and make that policy explicit before candidates or employees touch it locally.

Attribution:

ForOldHack #1 #2

Against the grain

GitHub may be choosing speed over friction

The more credible defense of GitHub was not that it is ignoring malware, but that it removes reported repos while refusing to make account creation and onboarding much harder. That reframes the failure as an incentive decision. Abuse prevention is competing with growth and ease of use, and ease of use appears to be winning.

If you depend on a large platform to police malware for you, check whether its business incentives line up with your risk tolerance. For high-trust workflows, build your own gates instead of assuming the platform will absorb the tradeoff.

Attribution:

mustaphah #1
pixl97 #1

Open source is not the thing failing

A few people pushed back on the claim that this story disproves open source safety. The attack worked because GitHub and search engines were used as trust shortcuts for binaries, scripts, and external downloads, not because source visibility itself made malware easier. In that framing, openness is how the abuse was discovered at all. The broken piece is distribution hygiene and platform enforcement.

Do not turn this into a blanket argument against open source in procurement or policy. Focus controls on provenance, release verification, and execution paths, which is where this campaign actually wins.

Attribution:

Yokohiii #1 #2
toofy #1

In plain english

Disco Trojan ↩

A named malware family that security tools use to group related malicious samples with similar behavior.

Flatpak ↩

A Linux app packaging and sandboxing system that can isolate desktop applications from the rest of the system.

Keyoxide ↩

A service that helps people publish and verify links between their cryptographic identities, domains, and social accounts.

MCP ↩

Model Context Protocol, a way for AI models and tools to connect to external resources and actions through a standard interface.

README ↩

A top-level project document that explains what a software project is, how to use it, and how to work with it.

SEO ↩

Search engine optimization, the practice of shaping web content to rank higher in search results.

TOTP ↩

Time-Based One-Time Password, a rotating login code generated by an authenticator app or hardware token.

VirusTotal ↩

A service that scans files and URLs with many security tools and reports whether any of them detect malware.

Reference links

Related incident reports

Someone used my repo to distribute malware
Another firsthand write-up of a repository being copied and used as a malware lure.
GitHub scam investigation thousands of mods and cracks stealing your data
A prior large-scale investigation into similar GitHub-hosted malware campaigns.
OpenSnitch discussion on a similar attack
Earlier report describing linked malicious accounts and repo networks on GitHub.

Malware analysis and detection

Genus Codes malware analysis result
Shared as an analysis of one sample from the campaign.
VirusTotal sample result
Shared to show external malware detection results for a related sample.
Warning to developers about technical test scams
Explains how malware is being delivered through fake recruiting assignments.
Analysis of a malicious technical test
Detailed reverse engineering of a scam codebase mentioned as another example of this delivery method.

Platform defenses and tooling

GitHub Security Lab scanning framework post
Cited as evidence that GitHub already has scanning tooling, though people argued it is noisy or insufficient for this abuse class.
gh-trends
Project used by one commenter who said they repeatedly encountered malware repos in trend data.
GitHub trends growth-based approach
Background on the trend-detection logic used to spot suspicious recency patterns.

Authentication and trust references

Ente Auth
Suggested as a separate authenticator app for keeping TOTP outside a password manager.
The Cathedral and the Bazaar
Referenced in the debate about whether open source visibility should be treated as a safety signal.
Linus's Law
Linked to clarify the classic claim that enough reviewers make bugs easier to find, which people argued is often misapplied to malware trust.

I found 10k GitHub repositories distributing Trojan malware

Discussion mood

Key insights

Against the grain

In plain english

Reference links

Related incident reports

Malware analysis and detection

Platform defenses and tooling

Authentication and trust references