v1.3.03 loop iterations

AI Output QA Checklist

AI accelerates production but introduces specific, predictable failure modes — invented statistics, hedging language, the smooth-but-empty tone — and one of them in a client deliverable can destroy months of trust. The AI Output QA Checklist is the last review pass before anything AI-assisted leaves the agency.

What this skill does

AI-assisted work fails in a predictable pattern. The most dangerous failure is the confident wrong fact — an invented stat, a misattributed quote, a feature claim that's a version behind. Style issues are recoverable; a fabricated McKinsey citation in a board paper is not. The skill prioritises factual integrity first, AI language patterns second, tone and structure third — in that order, because that's the order things actually damage trust.

The factual integrity scan flags specific statistics without sources, named studies or reports, historical claims, company-specific numbers, and technical specifics like version numbers or pricing that go stale fast. Each gets a verification tag with the exact claim and why it's suspicious, not a vague "some claims may need checking." Then the AI language pattern pass catches the hedging openers, the exhaustive lists, the false-balance constructions, the hollow transitions, the non-committal conclusions, the delve-and-leverage family, and the sycophantic softeners. Each gets a location and a suggested replacement, not just a complaint.

The tone and structure checks catch the things that make experienced readers feel "AI wrote this" without being able to name why — register shifts within the same piece, perfectly parallel sections, suspiciously comprehensive coverage that hits every angle instead of the angles that matter, missing point of view, the obligatory restating-the-topic intro. The client-fit pass then verifies the work uses the client's terminology, references their specific situation, doesn't contradict last month's deliverables, and is actually actionable for this audience.

The output puts the verdict at the top — PASS, PASS WITH EDITS, or NEEDS REWORK — followed by a flagged table with locations and suggested fixes, plus five quick fixes you can apply in five minutes before sending. Different content types get different standards: a blog post tolerates more AI patterns than a board paper, a transcription needs less style review than a thought leadership piece. The skill asks about audience first because reviewing without that context produces generic advice and misses the actual risks.

When this triggers

·You used AI to draft a client deliverable and need a forensic pass before sending
·A junior on the team has been shipping AI-assisted work and you're not sure what's slipping through
·You suspect a stat or attribution in a draft is invented and need it flagged for verification
·You're building a review SOP for AI-assisted work and need the checklist to back it
·A client deliverable is going to a senior stakeholder and the standard is 'no AI tells, no shortcuts'

Example

Trigger

User pastes a 900-word thought leadership post for a FTSE 250 client. Wants a QA pass before sending.

Output

Overall: PASS WITH EDITS. Factual claims to verify: · [VERIFY — line 14] "73% of enterprises have adopted AI per a 2024 McKinsey study" — specific stat, specific source. AI frequently invents both. Confirm or remove before sending. · [VERIFY — line 27] "Gartner predicts $X by 2027" — likely real prediction, possibly misquoted figure. Check. AI language patterns detected: · Line 1: "In today's rapidly evolving landscape" — hedging opener. Cut and start with the actual claim. · Line 8, 19, 22: "leverage", "harness", "unlock" — the delve family. Replace with concrete verbs. · Line 42: "Ultimately, the best approach depends on your specific needs" — non-committal close. The client hired you for a position. Take one. Tone: · Reads generic-corporate, not like [Client]'s usual voice. Their previous posts use sentence fragments and direct address. None of those here. Quick fixes (5 minutes): 1. Delete the opener, start at "The reason most..." 2. Replace 3 verbs flagged above with concrete alternatives 3. Verify or remove the McKinsey stat 4. Rewrite the conclusion to take a position

Get this skill + 15 more

Included in the The Agency Owner Stack — scale delivery without scaling headcount. Save $130+ vs buying individually.

Get The Agency Owner Stack — $149

What you get

134-line SKILL.md, ready to drop into ~/.claude/skills/
Tested through 3 Karpathy-loop iterations (versions v1.0.0 → v1.3.0)
Triggers automatically when relevant — no command to remember
Lifetime updates as the skill is refined further