What It Means
A technique where AI-generated text is subtly marked during generation by biasing the model's word choices toward specific patterns invisible to human readers but detectable by specialized tools. Watermarking embeds a statistical signature in the text itself.
Why writers should care: Watermarking could eventually solve the detection problem - but only if all AI providers agree to use it, and only if the marks survive editing.
In Context
Watermarking is the most promising and most contentious proposed solution to the detection problem. If AI providers embedded invisible statistical patterns in every piece of generated text, detection could become near-certain. But watermarks can be removed by paraphrasing, and voluntary adoption means only some providers would participate. OpenAI delayed deploying watermarking for two years, citing competitive concerns. Until watermarking is universal and mandatory, the detection problem remains fundamentally unsolvable.
Related Terms
- AI Detection - Software that attempts to determine whether a piece of text was written by a human or generated by an artificial intelligence.
- Algorithmic Bias - Systematic errors in AI systems that produce unfair outcomes for certain groups.
- Burstiness - A measure of how much variation exists in the complexity and length of sentences within a piece of writing.
- C2PA - The Coalition for Content Provenance and Authenticity - an open standard for certifying the origin and history of digital content.
- Content Provenance - The documented history of a piece of content from its creation through every edit, save, and publication.