Quick Answer
Most readers can learn to recognize ChatGPT in a single paragraph after seeing the pattern named once. This guide lays out the 12 specific signatures, why they appear, how Claude and Gemini compare, and where the tells mislead. To test a paragraph against the same signals teachers use, paste it into our AI Detector tool.
Why ChatGPT Has a Style Signature
ChatGPT writes the way it does for a reason. OpenAI trained the model on a broad slice of internet and book text, then fine-tuned it with Reinforcement Learning from Human Feedback (RLHF). Human raters consistently preferred responses that were polite, balanced, structured, and hedged. The model learned to write that way by default.
The same RLHF preference for helpful and harmless output trained the model to use safe vocabulary, careful symmetry, and a polite register. The result is text that reads competently but predictably. The fingerprint shows up across GPT-3.5, GPT-4, and GPT-4o, with each newer version slightly less obvious but never invisible.
The 12 ChatGPT Tells
1. Em-Dash Addiction
ChatGPT loves the em-dash character. Independent samples of GPT-4 output average two to four em-dashes per 500 words. Typical undergraduate writing has fewer than one. Look for parenthetical phrases set off with em-dashes where a comma or period would do the job. Example: The project was delayed, set off with em-dashes around despite the team's best efforts, until late August.
2. Opening With Certainly! or I'm Here to Help
ChatGPT cannot stop being polite. When given a task, it often opens with Certainly!, Of course!, Absolutely!, or I'm here to help. Even when the prompt asks for a draft and not a chat reply, residue of the assistant register slips in. A formal essay that opens with a one-line affirmation is almost always a paste-from-ChatGPT.
3. Delve Into
The verb delve is rare in casual English. By 2024 it was so over-represented in ChatGPT output that researchers used its frequency alone as a weak AI signal. If a 500-word piece uses delve into once, it could be coincidence. Twice is suspicious. Three times is a fingerprint.
4. Tapestry Of
A close cousin of delve. ChatGPT reaches for tapestry to describe anything complex or interconnected: a tapestry of experiences, a tapestry of cultures, a rich tapestry of voices. The metaphor is functional but the model leans on it far harder than human writers do.
5. Navigating the Complexities
The phrase navigating the complexities (or navigating the challenges, navigating the nuances) appears in roughly one in twenty ChatGPT essays on social topics. It is a hedge that signals depth without committing to a specific claim. Native human writers usually pick a concrete verb instead.
6. In Today's Digital Age
Almost any prompt about technology, society, or culture triggers in today's digital age as an opener. Variants include in our increasingly connected world and in the modern era. The phrase carries no information. It exists to soften the model into its opening paragraph.
7. Hedging Language
ChatGPT hedges constantly. It could be argued that, one might say, some experts believe, this could be seen as. The hedging is RLHF residue: human raters preferred answers that did not commit to strong claims. The result is prose that sounds careful but rarely takes a position.
8. Uniform Paragraph Length
ChatGPT paragraphs cluster around three to five sentences each. Open a five-paragraph response and measure: most paragraphs will land within one sentence of each other. Human writers swing from one-line paragraphs to ten-line ones depending on emphasis. The metronome rhythm of ChatGPT is one of the loudest non-vocabulary tells.
9. List Structures
When in doubt, ChatGPT bullets. Even prose responses are interrupted by numbered or bulleted lists, sometimes for three items that would read better as a single sentence. Gemini is worse on this dimension, but ChatGPT is still far above human baseline.
10. Symmetric Arguments
Every claim gets a counter. On one hand, on the other hand, while X is true, Y must also be considered. The symmetry is so consistent that researchers have used balance-of-argument as a model fingerprint. Human writers more often pick a side.
11. Closing With In Conclusion
A real human essay rarely uses the literal phrase in conclusion. ChatGPT uses it as a default closing transition. Variants include to sum up, in summary, ultimately. The closing paragraph then restates the introduction rather than adding new insight.
12. Overly Polite Tone in Every Register
Whether the prompt asks for a snarky tweet, a formal cover letter, or a casual blog post, ChatGPT defaults to a polite, neutral, professional register. Genuine snark, sharp opinion, and unhedged frustration are rare in default output. The flatness is the giveaway. Real writers have moods. ChatGPT has one mood.
ChatGPT vs Claude vs Gemini: Style Differences
The big three frontier models share a low burstiness baseline and vocabulary repetition, but each has its own fingerprint.
- ChatGPT: Verbose, em-dash heavy, hedging-prone, polite openings, delve and tapestry, in conclusion endings.
- Claude: Thoughtful pacing, fewer fixed cliches, more self-correction (e.g., actually, on reflection), still uniform paragraph length, prefers commas over em-dashes.
- Gemini: Enumerated, table-heavy, list-driven, often opens with a one-line direct answer then bullets, strong structural fingerprint.
- All three share: Low burstiness, vocabulary repetition within 500 words, symmetric arguments, generic examples.
Detecting GPT-4 vs GPT-3.5
GPT-3.5 wears its tells on its sleeve. Five-paragraph essay structure, two delves, four em-dashes, and an in conclusion ending appear in a typical 500-word draft. GPT-4 is harder. Em-dash use drops slightly, hedging is more nuanced, and cliches appear less frequently. GPT-4o softens the polite openings further.
But the deep statistical signal stays. Burstiness stays low. Paragraph length stays uniform. Em-dash use stays elevated. Certainly! still slips in. Researchers running benchmark detection on GPT-4o (Mitchell et al. and follow-up work in 2024) found that the strongest detectors still flag GPT-4o text well above human baseline. The tells migrated, but they did not vanish.
Live Demo: Run This Text Through Our Detector
Here is a 110-word sample written by GPT-4 from the prompt write a short paragraph about remote work:
In today's digital age, remote work has fundamentally transformed the way professionals navigate the complexities of their careers. While it offers a tapestry of benefits, including increased flexibility and the ability to delve into a healthier work-life balance, it also presents challenges, such as feelings of isolation and difficulty separating personal and professional life. Companies must adopt robust frameworks to support their distributed teams, leveraging communication tools and fostering a culture of trust. Ultimately, the future of work is ever-evolving, and organizations that embrace this shift while addressing its complexities will thrive in an increasingly interconnected world. In conclusion, remote work is here to stay.
Paste that paragraph into our AI Detectorand the verdict comes back Almost Certainly AI in under a second. The detector flags six cliche phrases (in today's digital age, navigating the complexities, tapestry of, delve into, robust framework, leveraging, ever-evolving, in conclusion), low burstiness (sentences cluster between 18 and 28 words), and structural symmetry (introduce, list benefits, list challenges, conclude). Each is a tell from the list above.
When These Tells Are Misleading
The 12 tells produce false positives in three notable genres.
- Academic abstracts. Journal abstracts are formal, hedged, symmetric, and packed with safe vocabulary. They look like ChatGPT because they are written to the same constraints: brevity, balance, and neutrality.
- Technical and legal documentation. Specifications, policy briefs, and legal memos prize hedging and symmetry on purpose. A high AI score on a contract is usually a false positive.
- Non-native English writers. Liang et al. (Stanford 2023) found that GPT detectors flagged 61% of TOEFL essays by non-native English speakers as AI-generated. Formal vocabulary, careful symmetry, and hedging are characteristic of second-language academic English.
The 12 tells are most reliable on student essays, blog drafts, social posts, marketing copy, and personal correspondence. Use them as signals to investigate, not as proof on their own. If you are a teacher or editor, combine multiple tells before drawing a conclusion. If you are a writer worried about false positives, read our companion guide on how to humanize AI text for the techniques that fix the underlying signal, not just the surface words.
The 30-Second Check
When you suspect ChatGPT wrote something, run this in order.
- Count em-dash characters in the first 200 words.
- Scan for delve, tapestry, navigating, in today's digital age.
- Measure paragraph length variance. Tight cluster around 3 to 5 sentences is a signal.
- Look for symmetric arguments and the phrase in conclusion.
- Paste the text into our free AI Detector to score the same signals automatically.
Two or three matches in 30 seconds is enough to identify ChatGPT output reliably. Add a high detector score and the case is essentially closed.
Sources
- Mitchell, E., Lee, K., Khazatsky, A., Manning, C.D., & Finn, C. (2023). DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature. Stanford University.
- Gehrmann, S., Strobelt, H., & Rush, A.M. (2019). GLTR: Statistical Detection and Visualization of Generated Text. Harvard NLP / MIT-IBM Watson AI Lab.
- OpenAI (2024). GPT-4 System Card and GPT-4o Technical Report.
- Anthropic (2024). Claude 3 Model Family Documentation and Constitutional AI Principles.