Claude Code is steganographically marking requests

AI
Developer Tools
Security
Privacy
Open Source

The post reverse engineers Claude Code and argues that the client is embedding low-visibility markers into requests. The apparent goal is to detect traffic coming through reseller gateways or environments that look linked to labs trying to collect outputs for distillation. The concrete signals discussed in the comments were not exotic. They included timezone checks, hostname and base URL pattern matching, and a built-in list of domains tied to Chinese companies, AI labs, and Claude resellers. The key point was not that Anthropic can see prompt traffic. Everyone already knows a cloud coding agent sees a lot. The point was that the client appears to be quietly shaping traffic for abuse detection in ways users were not plainly told about.

Most people reading it as a technical mechanism thought the intent was obvious and not very surprising. If a reseller forwards Claude traffic to Anthropic, hidden prompt markers let Anthropic spot which sessions came through that path even when the gateway strips ordinary telemetry. Several commenters said this kind of trap is weak against a sophisticated adversary but still useful because the game is not perfect prevention. It is forcing resellers and distillers to constantly diff releases, patch clients, and avoid accidental leaks. A cheap marker that survives for even a few days can still identify accounts, providers, or flows worth banning or watching. Where sentiment turned negative was trust. Claude Code runs on the user’s machine, reads repos, and executes commands. People kept coming back to the same standard for a tool with that level of access: the shipped client should be boring. Hidden fingerprinting crossed a line for them even if the payload here was low resolution. That concern was amplified by Anthropic’s recent reputation for silent or poorly signposted behavior changes, especially earlier complaints that some models could quietly degrade or sabotage responses for certain use cases. The result was a broader loss-of-confidence story, not just a narrow reaction to one steganography trick. A smaller but important thread pushed back on the privacy panic. Those commenters argued this was closer to anti-abuse telemetry than spyware, and that anyone sending code and prompts to a hosted coding agent has already accepted much larger privacy exposure. They saw the blog post as overstating harm unless there is proof Anthropic is using these markers to poison outputs or degrade legitimate users behind custom proxies. That unresolved question ended up being the practical hinge. People largely accepted that Anthropic wants to catch resellers and distillers. What they wanted next was evidence of what happens after a request gets tagged: account action, rate limits, weaker models, or nothing beyond analytics.

If you rely on closed-source coding agents, treat them like untrusted software with broad repo and shell access, not like a dumb client. Teams using proxies, custom harnesses, or sensitive code should audit what their agent sends and decide whether subscription convenience is worth giving the vendor this much hidden control.

June 30, 2026
thereallo.dev
Discuss on HN

Discussion mood

Mostly negative toward Anthropic. Even many people who accepted the anti-distillation rationale disliked the hidden client behavior, saw it as another trust-eroding surprise from Claude Code, and worried about silent degradation or future expansion of the tactic. A smaller camp thought the outrage was overheated because the signals appeared low-resolution and the product already requires sending far more sensitive data to Anthropic.

Key insights

Cheap markers still work in practice

Even a clumsy client-side marker can be useful because the goal is not to beat a determined lab forever. It is to catch unsophisticated resellers, create operational drag for serious actors, and exploit the fact that someone eventually forgets to patch a fresh release. That framing makes the sloppiness look less like incompetence and more like a fast-burn tripwire layered on top of heavier server-side detection.

Do not assume an easily reversed client check is pointless. If you run abuse prevention or anti-fraud systems, low-cost tripwires can pay off when they force continuous maintenance on the other side.

Attribution:

meowface #1 #2 #3
superfrank #1
hn_throwaway_99 #1

The hard part is keeping patches current

Once a vendor can require current versions, beating one fingerprint is not the problem. Beating every new fingerprint on every release is. That shifts the economics in the vendor's favor because distillers need a standing reverse-engineering loop, and any employee who updates without the patched binary can expose the whole operation.

When you evaluate whether a client-side defense is worth shipping, model the maintenance burden it creates, not just whether today's mechanism is technically bypassable. Repeated forced updates can turn weak checks into real leverage.

Attribution:

hhh #1
_alternator_ #1
SubiculumCode #1
charcircuit #1

The real risk is post-detection behavior

The highest-value unanswered question is not whether the marker exists. It is what Anthropic does with a tagged session. Several people connected this to earlier complaints about Claude Fable silently changing behavior and argued the serious failure mode would be covert output degradation, rate limiting, or model downgrades that are hard to prove and easy to misdiagnose as ordinary model variance.

If your business depends on a hosted model, monitor output quality and routing like a production dependency. Build canaries and comparison tests so you can detect silent service changes instead of arguing about them after the fact.

Attribution:

drdexebtjl #1
verdverm #1
bakugo #1
croemer #1
tgsovlerkhgsel #1

Open source helps, but provenance still matters

People reached for Codex CLI and other open tools as the obvious alternative, but one commenter sharpened that point. Open source lowers the chance of hidden client behavior only if users can trust the shipped binary actually corresponds to the repo. Signed releases and reproducible or at least auditable builds matter as much as source visibility.

If you want teams to trust a local agent, publish source and tighten the build story. Signed artifacts without reproducible builds are better than nothing, but they do not fully close the trust gap.

Attribution:

VortexLain #1
dannyw #1
__msh__ #1

Treat coding agents like hostile local software

Several comments broadened the lesson beyond Anthropic. A coding agent has enough access to become a supply chain and data exfiltration problem in its own right. The practical advice was to sandbox IDE plugins, LSP servers, and agent CLIs, run them under unprivileged accounts, and avoid leaking useful identity or environment metadata through usernames, hostnames, and unrestricted filesystem access.

Put coding agents in the same risk bucket as browser extensions and build tools. Use least privilege, isolate them from your main account, and assume anything visible to the process can end up in vendor telemetry.

Attribution:

codedokode #1
drnick1 #1
epistasis #1
puttycat #1

Custom harnesses are now mainstream fallback

A lot of practitioners no longer see the harness as proprietary magic. They described coding agents as mostly a system prompt, tool wiring, and a shell around file edits. That makes building or adopting your own harness a credible escape hatch, especially when cheaper models like GLM or DeepSeek are good enough for many workflows. The real thing Anthropic sells here is subsidized access to premium models, not an irreplaceable client.

Map how much of your dependency is the model versus the vendor's client. If the harness is the only lock-in, you likely have more negotiating power and migration options than you think.

Attribution:

wolttam #1
kolinko #1
andai #1 #2
echelon #1

Against the grain

This looks like narrow anti-abuse telemetry

The calmer read is that the blog post overstates the danger. The visible checks appear to capture very coarse signals tied to suspected proxying, not a rich fingerprint of the machine. From that view, comparing this to malware or broad surveillance muddies the distinction between targeted abuse detection and real privacy invasion.

Keep your threat model specific. Before overhauling tooling, separate evidence of limited anti-abuse logic from evidence of broader data harvesting or harmful downstream actions.

Attribution:

mrshadowgoose #1
jfreds #1
Terr_ #1
nomel #1

Cloud coding agents already break privacy purity

Another pushback was that anyone using a hosted coding assistant has already handed over code, prompts, and environment context to a third party. Hidden prompt markers may be distasteful, but they are not the first or largest privacy concession in this stack. For these commenters, the stronger critique is product trust and disclosure, not privacy in the absolute sense.

If privacy is the deciding issue, the clean answer is self-hosted or tightly controlled models. If you stay with hosted agents, focus your procurement and controls on transparency, retention, and auditability rather than assuming the client is the main exposure.

Attribution:

edude03 #1 #2
coolfox #1

Anthropic's worldview explains the tactic

A few commenters argued the behavior makes sense once you take Anthropic's AI safety and geopolitical claims at face value. If the company really believes frontier models are proto-superintelligence and that rival states must be denied access, then stealthy anti-distillation measures are not hypocrisy to them. They are standard containment policy. That does not make the tactic user-friendly, but it does make it predictable.

Do not judge frontier labs only by product docs. Their policy ideology will leak into product behavior, so treat governance posture as part of vendor selection.

Attribution:

solenoid0937 #1 #2 #3 #4

In plain english

CLI ↩

Command-line interface, a text-based way to use software from a terminal.

distillation ↩

In artificial intelligence, using the outputs of one model to train another model to imitate it.

IDE ↩

Integrated development environment, a software application that combines code editing, debugging, and other development tools.

LSP ↩

Language Server Protocol, a standard used by editors and IDEs to provide coding features like autocomplete and diagnostics.

steganography ↩

A way of hiding information inside another message so the hidden data is not obvious to the recipient or observer.

Reference links

Primary sources and reverse engineering

Anthropic GitHub issue response about Claude Code
Official response referenced in the discussion
List of domains found in Claude Code analysis
The extracted domain list at the center of the reverse-engineering claims
Archived copy of the blog post
Mirror of the original post for readers blocked by the source site

Prior Anthropic behavior and related debates

Anthropic engineering postmortem from April 23
Used as evidence that Claude Code changes and regressions have affected users before
Claude Fable and Mythos announcement
Referenced in debate over whether Anthropic disclosed model behavior changes clearly
Blog post arguing Claude Fable could sabotage competitor use cases
Brought in as prior evidence of silent or poorly surfaced model degradation concerns

Tools and alternatives

Codex CLI repository
Referenced as an open source alternative with more inspectable client behavior
Minimal Agent tutorial
Suggested starting point for building a personal coding-agent harness
Homunctor repository
Example open source personal harness shared in the comments
How to build an agent
Another practical guide for building a simple agent harness

Background and analogies

Underhanded C Contest
Referenced to frame the client behavior as a form of intentionally deceptive code
Trap street
Analogy for hidden markers used to trace copying or misuse
ChinaTalk on buying cheap Claude tokens in China
Explains the reseller market that many commenters believe Anthropic was targeting