7 June 2026 · 7 min read

The hidden cost of AI detection false positives

Teacher checking and marking student test papers — Photo by Andy Barbour via Pexels

The public conversation about AI detection in schools is almost entirely about the problem of students who cheat and go undetected. The other side of the equation gets far less attention: the students who are flagged when they shouldn't be, and what happens to them when they are.

What a false positive actually means

In statistical terms, a false positive is a case where the test returns a positive result but the underlying condition is absent - in this context, a student who is flagged as having used AI but who didn't. This can happen for several reasons: the student writes in formal, careful language; they are a non-native English speaker; their writing style happens to share statistical characteristics with AI-generated text.

In practical terms, a false positive can mean an accusation of academic misconduct. Depending on institutional policy, this might involve a formal interview, a mark reduction, a permanent record, or in severe cases suspension or expulsion. For international students whose visa status is tied to academic standing, even an unresolved misconduct allegation can have serious consequences. These are not minor inconveniences.

Who gets flagged most often

Research into AI detector accuracy consistently identifies the same groups as most likely to receive false positives. Non-native English speakers - including international students, EAL students, and heritage language speakers writing in their non-dominant language - are flagged at significantly higher rates than native speakers. Students who write carefully and formally, adopting academic conventions, produce text patterns that detectors associate with AI generation.

Students with certain cognitive profiles may also be affected. Those who write with unusual regularity or predictability - including some students with autism or other conditions that affect writing style - may produce text that reads as statistically AI-like without any AI involvement. The detector has no mechanism to account for this variation in human writing.

The research and its implications

A 2023 Stanford study tested seven commonly used AI detectors against essays written by non-native English speakers and found these tools flagged non-native writing as AI-generated at rates reaching 61%. The same texts written by native speakers were flagged far less often. Subsequent studies have replicated the finding across different tools, different language backgrounds, and different proficiency levels.

This isn't a fringe concern. In many UK schools and universities, international and EAL students form a substantial proportion of the student body. A detection method that systematically flags this group at disproportionate rates isn't a neutral tool applied equitably - it's a tool that creates discriminatory outcomes, regardless of intent. Several higher education institutions in the UK and US have suspended or discontinued text-based AI detectors as a direct response to this evidence.

The legal and institutional risk

In the UK, the Equality Act 2010 places a public sector equality duty on schools and universities requiring them to consider actively whether their policies produce equitable outcomes across protected characteristics. A detection tool that produces disparate outcomes for international students creates exactly the kind of indirect discrimination the duty is designed to prevent.

Several higher education institutions in the UK and US have suspended or discontinued text-based AI detectors entirely after reviewing the ESL false positive evidence. The practical risk of a high-profile false accusation is often greater than the risk of a missed detection.

Beyond formal legal exposure, there are reputational and relational risks. Schools that have publicly accused students of AI misconduct and later found the evidence was weak have faced significant reputational damage.

**Equity Act exposure** — disparate impact on international and EAL students may constitute indirect discrimination under the Equality Act 2010 (UK) or Title VI (US)
**Formal misconduct risk** — a false accusation that reaches formal proceedings can affect a student's academic record and, for international students, visa status
**Reputational damage** — high-profile wrongful accusations have caused significant institutional reputational harm when made public
**Teacher-student trust** — an unjust accusation is one of the most damaging things that can happen to that relationship, regardless of formal outcome

Designing for fairness

The most effective way to reduce false positive risk is to change the detection method. Text-based analysis produces ESL false positives because it evaluates writing style, and writing style correlates with language background. Process-based detection doesn't evaluate writing style at all - it evaluates writing behaviour, which doesn't correlate with language background in the same way.

The second most effective intervention is procedural: treating any detection signal as the beginning of an enquiry, not the end of one. No automated tool - text-based or process-based - should be the sole basis for a formal misconduct finding. A conversation with the student that addresses their specific process, combined with documented evidence, is substantially fairer and more reliable than any tool used alone.

Try Learnaway with your next homework

Set an assignment free Live demo

Why AI detectors flag ESL students as cheaters - and how teachers can avoid it
Text-based AI detectors are systematically biased against non-native English writers. Here's the research, the legal risk, and a fairer detection approach.
How to talk to a student you suspect used AI on an assignment
Accusations can backfire badly. Here's how to turn AI suspicion into a fair, evidence-based conversation - and when to leave it at that.
The best AI detectors for teachers in 2026: an honest guide
The landscape of AI detection tools is fast-moving and full of conflicting claims. This guide cuts through to help teachers make an informed choice for their classroom.