AI Output QA Checklist
AI accelerates production but introduces specific, predictable failure modes — invented statistics, hedging language, the smooth-but-empty tone — and one of them in a client deliverable can destroy months of trust. The AI Output QA Checklist is the last review pass before anything AI-assisted leaves the agency.
What this skill does
AI-assisted work fails in a predictable pattern. The most dangerous failure is the confident wrong fact — an invented stat, a misattributed quote, a feature claim that's a version behind. Style issues are recoverable; a fabricated McKinsey citation in a board paper is not. The skill prioritises factual integrity first, AI language patterns second, tone and structure third — in that order, because that's the order things actually damage trust.
The factual integrity scan flags specific statistics without sources, named studies or reports, historical claims, company-specific numbers, and technical specifics like version numbers or pricing that go stale fast. Each gets a verification tag with the exact claim and why it's suspicious, not a vague "some claims may need checking." Then the AI language pattern pass catches the hedging openers, the exhaustive lists, the false-balance constructions, the hollow transitions, the non-committal conclusions, the delve-and-leverage family, and the sycophantic softeners. Each gets a location and a suggested replacement, not just a complaint.
The tone and structure checks catch the things that make experienced readers feel "AI wrote this" without being able to name why — register shifts within the same piece, perfectly parallel sections, suspiciously comprehensive coverage that hits every angle instead of the angles that matter, missing point of view, the obligatory restating-the-topic intro. The client-fit pass then verifies the work uses the client's terminology, references their specific situation, doesn't contradict last month's deliverables, and is actually actionable for this audience.
The output puts the verdict at the top — PASS, PASS WITH EDITS, or NEEDS REWORK — followed by a flagged table with locations and suggested fixes, plus five quick fixes you can apply in five minutes before sending. Different content types get different standards: a blog post tolerates more AI patterns than a board paper, a transcription needs less style review than a thought leadership piece. The skill asks about audience first because reviewing without that context produces generic advice and misses the actual risks.
When this triggers
- ·You used AI to draft a client deliverable and need a forensic pass before sending
- ·A junior on the team has been shipping AI-assisted work and you're not sure what's slipping through
- ·You suspect a stat or attribution in a draft is invented and need it flagged for verification
- ·You're building a review SOP for AI-assisted work and need the checklist to back it
- ·A client deliverable is going to a senior stakeholder and the standard is 'no AI tells, no shortcuts'
Example
Trigger
User pastes a 900-word thought leadership post for a FTSE 250 client. Wants a QA pass before sending.
Output
Overall: PASS WITH EDITS. Factual claims to verify: · [VERIFY — line 14] "73% of enterprises have adopted AI per a 2024 McKinsey study" — specific stat, specific source. AI frequently invents both. Confirm or remove before sending. · [VERIFY — line 27] "Gartner predicts $X by 2027" — likely real prediction, possibly misquoted figure. Check. AI language patterns detected: · Line 1: "In today's rapidly evolving landscape" — hedging opener. Cut and start with the actual claim. · Line 8, 19, 22: "leverage", "harness", "unlock" — the delve family. Replace with concrete verbs. · Line 42: "Ultimately, the best approach depends on your specific needs" — non-committal close. The client hired you for a position. Take one. Tone: · Reads generic-corporate, not like [Client]'s usual voice. Their previous posts use sentence fragments and direct address. None of those here. Quick fixes (5 minutes): 1. Delete the opener, start at "The reason most..." 2. Replace 3 verbs flagged above with concrete alternatives 3. Verify or remove the McKinsey stat 4. Rewrite the conclusion to take a position
Get this skill + 15 more
Included in the The Agency Owner Stack — scale delivery without scaling headcount. Save $130+ vs buying individually.
Get The Agency Owner Stack — $149What you get
- 134-line SKILL.md, ready to drop into ~/.claude/skills/
- Tested through 3 Karpathy-loop iterations (versions v1.0.0 → v1.3.0)
- Triggers automatically when relevant — no command to remember
- Lifetime updates as the skill is refined further
More from Agency Operations
Takes a team roster, current projects, and pipeline, then produce a clear capacity model that shows where people are over-allocated, where there's slack, and what happens when pipeline converts
Runs the full process: extracting the right information, building the story arc, formatting for multiple distribution channels, and creating a distribution plan
Takes an agency's services, clients, strengths, and values, then craft a differentiated positioning statement and elevator pitch that makes them stand out in a crowded market
Takes 3 months of performance data and client context, then build a QBR document that demonstrates value delivered, frames performance honestly, identifies wins to celebrate, and positions natural…
Helps agencies and consultants maintain clear, honest records of what was AI-assisted, what was human-created, and what review process was applied
Builds the argument — both internal conviction and external messaging — for why a human agency is worth paying for when AI can generate content, analyse data, and build websites for nearly free
Browse the full library
297 skills across 31 categories. One purchase, lifetime updates.
See all bundles