Professor denounces mass AI fraud on an exam at Brown

AI
Education
Hiring
Developer Tools

The article centers on a Brown economics professor who says a take-home, closed-book midterm was swamped by AI use. The signal was hard to miss. Enrollment jumped after the easier format was announced, the midterm average hit 96 with 40 perfect scores, then the in-person final average fell to 48 and many students who had aced the midterm either failed badly or did not show up. The professor’s bigger complaint was not just cheating but institutional drift. He says senior leadership largely left faculty to deal with it, which many readers saw as the more important story.

The strongest consensus was that this is less an “AI suddenly broke education” story than a collision between new tools and already fragile assessment practices. Plenty of people said take-home, closed-book exams at elite schools had always relied on a culture of trust that was already weakening before LLMs. AI changed the scale, speed, and accessibility of cheating. It turned what used to require a friend, paid help, or serious effort into a one-window workflow. That is why many commenters thought the old honor-code model is done for foundational courses, especially in large classes. Where the discussion landed was practical. For core skills, universities are drifting back to proctored environments, either pen-and-paper or locked-down computer labs with restricted networks and standard software. Several professors and students described this as routine already, not futuristic. Others argued the better redesign is not “ban computers” but “verify ownership”: oral defenses, short interviews about submitted work, structured presentations, frequent in-person quizzes, and project reviews where students must explain choices under questioning. That keeps AI from substituting for understanding even when it helps produce output. A second thread went past exam mechanics to incentives. Many commenters were blunt that students are not primarily optimizing for learning. They are optimizing for grades, credential value, graduate school, finance or consulting jobs, and avoiding the financial cost of failure. In curved or competitive environments, widespread cheating creates a prisoner’s-dilemma dynamic where honest students feel punished for staying honest. That did not excuse cheating, but it did explain why appeals to integrity alone look weak when universities inflate grades, treat students like customers, and avoid enforcement that could hurt revenue or reputation. The result is a narrower, more realistic view of what higher education can still certify. Closed, supervised assessments can still verify fundamentals. Open-ended assignments can still be useful, but only if schools stop pretending the final artifact proves independent work. Once AI can generate plausible essays, code, and explanations, the proof has to come from process, interaction, and demonstrated command under constraints.

If you hire new grads or run training programs, assume transcripts now reveal less about actual ability unless the institution can explain how it authenticates work. For educators, the practical move is to split assessment into two buckets now: closed, identity-verified checks for fundamentals and separate AI-allowed work that explicitly tests tool use and judgment.

June 29, 2026
english.elpais.com
Discuss on HN

Key insights

Verify ownership instead of authorship

The workable shift is from asking whether a student personally typed every line to asking whether they can explain, defend, and modify what they submitted. That is why some CS instructors are adding one-on-one check-ins after assignments and asking pointed questions about libraries, design choices, failed approaches, and code behavior. This does not eliminate AI assistance. It makes unsupported outsourcing much harder to hide, which is the part that matters for certifying competence.

If your team reviews candidate projects, add a short defense step and ask for concrete reasoning about tradeoffs and implementation details. In courses or internal training, treat submitted work as a starting point for verification, not the proof itself.

Attribution:

bkallus #1
Cthulhu_ #1
liendolucas #1

Locked-down labs already exist

The infrastructure answer is not hypothetical. Multiple people described computer-based testing facilities that boot into restricted environments, allow only the exam platform, and support subjects like programming and security where paper is a poor fit. The constraint is not technical possibility. It is space, staffing, maintenance, and institutional will. That makes this less a software problem than a resource allocation problem.

If you run an institution or training program, stop treating secure digital assessment as a moonshot and budget for it like any other lab function. If you cannot support it at scale, reserve it for the courses where paper distorts what you are trying to measure.

Attribution:

nneonneo #1
StrauXX #1
robertlagrant #1
recursivedoubts #1 #2

The artifact is no longer credible evidence

Once AI can produce polished code, reports, and essays, the final file stops being trustworthy as evidence of learning. Several commenters pointed to alternatives that restore signal without fully abandoning digital work, including video walkthroughs, live Q&A, and oral or whiteboard defenses. The key change is that the evaluator watches the student think through the work, not just consume the finished product.

When evaluating external candidates, ask for a walkthrough or live critique of past work instead of judging the document alone. If you rely on portfolio reviews, expect more polished fraud and adjust the process now.

Attribution:

aabajian #1
tpoacher #1
cm2187 #1

Curved grading turns cheating into a prisoner’s dilemma

The strongest incentive argument was not that cheating is morally fine, but that relative grading makes abstaining costly when others cheat. In a curved class, one student’s fraudulent A can become another student’s honest B or worse. That changes cheating from private misconduct into a competitive arms race. Several people argued that absolute standards are easier to defend because they remove some of the pressure to defect.

If you control evaluation systems, avoid curves where cheating risk is high. In hiring and promotion, be careful with forced ranking when measurement quality is degraded. It pushes people toward gaming instead of skill building.

Attribution:

pants2 #1
haritha-j #1
Yossarrian22 #1
odyssey7 #1

University leadership incentives are misaligned

Many readers thought the silence from Brown’s administration was entirely believable because institutions have been drifting toward customer-service logic for years. Strict enforcement creates angry students, parents, donors, and worse completion metrics. That makes academic integrity a cost center unless senior leadership is willing to absorb the pain. The complaint was not that faculty lack ideas. It was that they are being asked to police a structural problem alone.

When a credential issuer cannot explain how it handles fraud, discount the credential more heavily. Inside organizations, do not hand frontline managers a fraud problem without backing, budget, and clear escalation rules.

Attribution:

cycomanic #1
bonoboTP #1
intended #1

Ungraded or low-stakes homework may regain value

A useful adaptation is to stop pretending homework proves independent mastery and instead make it practice. One commenter described giving full credit for on-time completion while moving the real accountability into in-class handwritten exams. Another noted that once homework no longer determines rank, the incentive to use an LLM drops and the assignment can return to its original job of rehearsal and feedback.

Separate practice from certification in your own training systems. Use low-stakes exercises to build fluency, then verify competence in a setting where identity and independent performance are clear.

Attribution:

lmc #1
nneonneo #1
jdshaffer #1

Against the grain

AI use exposes an older pedagogy failure

One contrarian line held that the cheating panic is mixing two issues. AI is real, but it is landing on an education system that already overtrained students to solve narrow, preformatted problems and chase memorized test logic. From that view, the deeper risk is producing graduates who freeze when a problem is underspecified and then outsource the missing thinking to a model. AI did not create that habit. It amplified it.

When you design coursework or interviews, include messier problems with incomplete instructions and watch how people frame the task. That will tell you more about judgment than another polished output ever will.

Attribution:

Royce-CMR #1
tedd4u #1

Presentations can scale better than full oral exams

While many people dismissed one-on-one assessment as too expensive, some argued that short presentations with Q&A hit a useful middle ground. They preserve the ability to probe understanding, tolerate AI-assisted drafting, and can be graded in coarse bands rather than fake precision. That is still slower than batch grading. It is much more scalable than a dissertation-style oral exam for every student.

If you need more authentication but cannot afford full interviews, pilot brief recorded or live defenses for selected assignments. You may recover most of the signal with far less staffing cost.

Attribution:

parpfish #1
JoshTriplett #1 #2

The article overstates the certainty of guilt

A minority pushed back on the article’s level of certainty. They argued that high scores, answer similarity, and later collapse on an in-person final are strong indicators, but still circumstantial if the school lacks direct evidence for each student. That matters because false accusations in an AI panic can be career-changing, and commenters were wary of universities overreaching with shaky proof or unreliable detection tools.

Treat abrupt performance shifts as a trigger for secondary verification, not as a complete evidence trail by themselves. In any anti-fraud system, build a human review and appeal path before attaching severe penalties.

Attribution:

meerita #1
tty456 #1

In plain english

AI ↩

Artificial intelligence, software systems that generate or analyze content in ways that mimic tasks usually associated with human intelligence.

LLM ↩

Large language model, an AI system trained on large amounts of text that can generate and transform language and code.

Reference links

Assessment and exam infrastructure

Universities and AI
A professor’s essay on redesigning university courses and assessments for the AI era that anchored much of the practical discussion.
University of Illinois Computer-Based Testing Facility
Example of a scaled institutional testing center used to support frequent supervised digital assessments.
ICPC
Referenced as a real-world model for restricted-network computer-based technical testing.

Alternative teaching and anti-cheating approaches

Exam with chatbots
Shared as a perspective on designing exams around AI rather than pretending it is absent.
Creating Meaningful Homework Assignments in the Age of AI and LLMs
A professor’s write-up of tactics for preserving homework as practice despite easy AI assistance.
Deschooling Society
Invoked in a broader critique of schooling centered on testing and certification rather than learning.

Cases and reporting on cheating

Cheating at Purdue University
Cited as a similar recent case of widespread AI-related cheating in a university course.
USMLE cheating scandal report
Used as an example that even heavily proctored credentialing systems can be gamed through long-horizon question harvesting.

Honor codes and grade inflation

Honor system at the University of Virginia
Referenced to show how even strict honor-code systems have softened sanctions over time.
Harvard admits that grades have lost their meaning
Brought in as evidence for grade inflation weakening the value of transcripts.
Reed College grade inflation data
Used to illustrate a school that resists grade inflation and de-emphasizes grades.

Writing by hand and related research

Correlation between handwriting, drawing skills and dental skills of junior dental students
Shared in support of claims that handwriting practice may connect to fine motor skill development.
ResearchGate copy of the handwriting and dental skills study
Posted so readers could inspect the study directly and critique its strength.
Frontiers study on cursive and handwriting
Used to argue that handwriting matters more than cursive specifically and that the evidence for cursive is modest.

Technology and products mentioned

LEAP Low-Bandwidth Educational Access Platform
Mentioned as a potentially useful low-cost educational computing setup.
LEAP GitHub repository
The implementation repository for the LEAP platform referenced in the same suggestion.
AP report on a professor using typewriters to curb AI use
Cited as an example of instructors reverting to older tools to reduce AI-assisted work.

Professor denounces mass AI fraud on an exam at Brown

Discussion mood

Key insights

Against the grain

In plain english

Reference links

Assessment and exam infrastructure

Alternative teaching and anti-cheating approaches

Cases and reporting on cheating

Honor codes and grade inflation

Writing by hand and related research

Technology and products mentioned