HN Debrief The signal in the discussion

AI Agent Guidelines for CS336 at Stanford

AI
Education
Developer Tools
Programming

The post links to Stanford CS336's agent instructions for assignment repos. The file tells tools like Claude Code to behave as learning assistants rather than solution generators. It says they should explain, ask guiding questions, avoid writing full answers, and avoid taking actions like directly running commands on the student's behalf. In practice, this is a repo-level prompt that tries to turn an AI coding harness from an automation tool into a tutor.

Most people treated it as a sensible first step, not a complete solution. The positive read was that universities have already lost the option of pretending students will not use coding agents. So the useful move is to model good use explicitly, put the guidance where tools can read it, and make AI part of the course contract instead of forcing everyone into fake prohibition. Several commenters said this is the first approach they have seen that acknowledges how software work is actually done now while still trying to preserve learning. The skepticism was blunt. A markdown file is easy to edit, bypass, or ignore. If a student wants answers, another model or another tool will happily give them. That pushed the conversation toward assessment rather than prompting. Repeatedly, people landed on oral exams, in-person tests, and other controlled evaluations as the only credible way to tell whether a student actually learned the material. The file can signal intent and shape behavior for honest students. It cannot serve as enforcement. A second theme was that the hardest educational problem is not classic cheating but false fluency. AI can make students feel productive while putting them in a passive role. Professors and learners in the comments described a real drop in help-session attendance and a pattern where weaker students seem to think they understand because the assistant is so smooth and accommodating. That is why some defended restrictions like "don't run bash commands". The point is not shell purism. It is making students hit the error, inspect it, and build the mental model instead of watching an agent silently recover. There was also a tooling subthread. The Stanford files are duplicated as both CLAUDE.md and AGENTS.md because different coding harnesses look for different filenames. Commenters called out that this guidance builds on Carson Gross's earlier agent.md gist, and some saw the CLAUDE.md convention itself as product branding more than open standard. Even so, the underlying pattern got broad approval. Repo-local agent instructions are becoming a real interface layer. Education may end up being one of the clearest places where that interface gets formalized first.

The interesting move is not the prompt file itself but the shift toward embedding AI usage policy directly into repos and tooling, which could become standard in education and eventually enterprise training even if real enforcement still has to happen through assessment design.

26 May, 2026
github.com
Discuss on HN

Discussion mood

Cautiously positive about the intent, but unconvinced that prompt files alone can protect learning. People liked the attempt to treat AI as a tool to be guided rather than banned, yet kept returning to the same point: only assessment design and student incentives determine whether this works.

Key insights

01 Prompt files are weaker than the environment they run in.
People using agents in production said long system prompts are often less important than the user's task, tool feedback, and hard hooks that force behavior. If you truly need transcript capture, restricted actions, or other guarantees, implement them in the harness instead of hoping the model keeps obeying prose. That reframes Stanford's file as orientation, not control.

Treat AGENTS.md and CLAUDE.md as a bootloader. The real policy surface is the harness, the tools, and the evaluation setup.
- wrs #1
- bob1029 #1
- weird-eye-issue #1
02 The bigger risk is false confidence, not just plagiarism.
A physics professor reported that help-session attendance collapsed and weaker students' grades fell hardest, which fits the claim that AI tutors can make students feel like they are practicing when they are mostly consuming polished explanations. Smooth answers can suppress the productive struggle that actually builds recall and transfer.

AI can hide learning failure behind a feeling of fluency. The students most in need of friction may be the first to lose it.
- hibikir #1
- MengerSponge #1 #2
03 The ban on agent-run shell commands is pedagogical, not arbitrary.
Commenters unpacked the example of a server failing because a port is busy. An agent will often diagnose, kill the process, and rerun everything before the student even notices the problem. Making the student execute commands keeps the error visible and turns debugging into part of the lesson.

Manual execution preserves the feedback loop. If the agent cleans up every mess, the student misses the mechanism.
- ohrus #1
- alfonsodev #1
- charlie90 #1
04 This is already becoming a reusable pattern for courseware.
Commenters confirmed Stanford borrowed from Carson Gross's earlier gist, packaged the policy into repo files that coding harnesses can ingest automatically, and sparked immediate interest from other instructors planning similar files. That makes the story less about one class policy and more about a new educational artifact. Courses may start shipping an agent contract alongside the syllabus and starter code.

AI policy is moving from PDF rules into machine-readable repo defaults. That is a durable shift even if the first versions are rough.
- philipportner #1
- brunborg #1
- ohmahjong #1
- abahgat #1

Against the grain

01 Restricting the agent may be the wrong abstraction.
Some argued universities should allow full AI use and put all responsibility on students to learn, then verify understanding separately. In that view, guided prompts infantilize capable students and blunt a legitimate way to learn by studying, reverse-engineering, and extending generated solutions.

A tutor-only policy may optimize for preventing abuse at the cost of limiting high-agency learners.
- baddash #1
- jltsiren #1
02 Soft guidance looks naive when incentives are this distorted.
Critics argued that elite universities should stop pretending honor-based norms will hold when students face credential pressure and already have abundant cheating channels. Their alternative was sharper separation. Either design modules around audited AI-heavy work, or go back to tightly controlled non-AI assessment.

Middle-ground policies can become expensive theater. If institutions want credibility, they may need cleaner splits between AI-allowed and AI-banned work.
- gaiagraphia #1
- chalupa-supreme #1
- djeastm #1
03 Punishing 'over-reliance' is harder to defend than it sounds.
One commenter pushed back on the idea that using AI too much should affect grades later, comparing it to other external study aids. That exposes a governance problem. Once AI is allowed in some form, instructors need a clear theory of what counts as misuse beyond a vibe that the student leaned on it too hard.

If AI is permitted, grading policy has to define unacceptable dependency precisely. Otherwise enforcement will feel arbitrary.
- j_french #1
- aaaronic #1
- knollimar #1

← Prev
12 / 21
Next →

Reference links

Stanford course materials

Stanford CS336 course site
Referenced as the course website that mentions the agent guidelines and attribution to the earlier gist.

Agent instruction patterns and tooling

Carson Gross agent.md gist
The earlier agent instruction file that Stanford's version was said to build on.

Education and AI commentary

New York Times opinion piece on AI and cheating at Stanford
Quoted to argue that honor-code assumptions may no longer match actual student behavior.
Charles Petzold, Does Visual Studio Rot The Mind?
Used as a historical analogy for debates about abstractions and generated code weakening understanding.

Related discussion and references

CS336: Language Modeling from Scratch on Hacker News
Linked as related context about the course itself.
Know Your Meme: Chat, Is This Real?
Shared during a side discussion about students using 'chat' to refer to LLMs or an audience.
YouTube clip referenced as a joke about 'guidelines'
Posted as a humorous reaction to the idea that guidelines are not strict rules.