What It Means
A measurement of how "surprised" a language model is by a piece of text. Low perplexity means the text is predictable - each word follows naturally from the last. High perplexity means the text contains unexpected word choices, unusual phrasing, or creative language.
Why writers should care: Many detectors flag low-perplexity text as AI-generated. But some human writers naturally write in clear, predictable prose - and get punished for it.
In Context
Perplexity is the statistical backbone of most AI detection tools. Low perplexity means predictable text - words that follow expected patterns. High perplexity means surprising text. The assumption: AI writes predictably, humans write surprisingly. The problem: many humans write clearly and predictably. Technical writers, legal professionals, and journalists trained to write clean, direct prose naturally produce low-perplexity text. The detector sees a machine. The writer knows better.
Related Terms
- AI Detection - Software that attempts to determine whether a piece of text was written by a human or generated by an artificial intelligence.
- Algorithmic Bias - Systematic errors in AI systems that produce unfair outcomes for certain groups.
- Burstiness - A measure of how much variation exists in the complexity and length of sentences within a piece of writing.
- C2PA - The Coalition for Content Provenance and Authenticity - an open standard for certifying the origin and history of digital content.
- Content Provenance - The documented history of a piece of content from its creation through every edit, save, and publication.