The Next Input — Issue #101

When AI Lies About Real Events

Aaron Bost
December 15, 2025

In partnership with

Looking at you Grok…

⚡ The Briefing — 60 sec

Grok gets the facts wrong about Bondi Beach shooting
Here at the Next Input, being in Sydney, it hits differently when its local etc. To all those affected and the greater community, our thoughts and our hearts are with you. On a lighter note, this is EXACTLY why we don't promote Grok all that much
Google Translate now lets you hear real-time translations in your headphones
なぜ5年間も私は日本語を勉強したのでしょうか?
Triple Zero call service needs AI tech upgrade, survey finds
The tech is coming—and in a genuinely good way. I’ve seen this work up close. When it’s done right, it saves time and lives.

🛠️ The Playbook — The High-Stakes Accuracy Layer

Mission Add a mandatory accuracy and verification layer for AI outputs used in sensitive, real-world contexts.
Difficulty Advanced
Build time 3 hours
ROI Prevents misinformation, protects trust, and avoids catastrophic “AI said it” moments.

0) Why This Matters

When AI is used around public safety, emergencies, or real people, accuracy isn’t a feature—it’s the baseline.
The Bondi example shows what happens when models speculate instead of verify.
This layer forces AI systems to slow down, cross-check, and escalate uncertainty instead of bluffing.

1) Architecture

Component	Tool	Purpose
Input Gate	API Gateway / App Layer	Routes sensitive requests
Verifier	Claude 4.5 Haiku	Fact-check and uncertainty detection
Cross-Check	GPT-5-mini	Validate claims against known data
Confidence Scorer	Rules Engine	Assign confidence thresholds
Escalation	Human-in-the-loop Queue	Hand off low-confidence outputs

2) Workflow

Sensitive query is detected (news, emergencies, incidents, health, safety).
Output is not returned immediately to the user.
Claude 4.5 Haiku checks for:
- unverifiable claims
- speculation
- missing sources
GPT-5-mini runs a secondary pass to confirm factual consistency.
System assigns a confidence score.
If confidence is below threshold:
- response is softened
- uncertainty is stated clearly
- or human review is triggered

3) Example Prompts

Primary Accuracy Check (Claude 4.5 Haiku)

Review this response for:
- factual accuracy
- speculative language
- missing verification
Return:
- Safe to publish / Needs caution / Do not publish
Include reasoning in one sentence.

Cross-Validation Prompt (GPT-5-mini)

Cross-check all factual claims.
If a claim cannot be verified, flag it clearly.
Do not infer or guess missing details.

4) Guardrails

Never allow confident language when facts are uncertain.
Default to “we don’t know yet” in unfolding events.
Escalate instead of filling gaps.
Log every flagged response for audit and tuning.

5) Pilot Rollout — 3 hours

Define what counts as “high-stakes” content.
Insert the accuracy layer before final output.
Test against 20 historical news scenarios.
Review false positives and tune thresholds.
Add human escalation for lowest-confidence cases.

6) Metrics

Number of responses flagged before release
Reduction in factual corrections
Average confidence score over time
Human escalation rate
Trust score from end users

Pro Tip: Silence is better than speed when stakes are high. Make that a system rule.

🎯 The Arsenal — Tools & Platforms

OpenAI Moderation API · Early detection of sensitive scenarios · https://platform.openai.com
LangSmith Evaluations · Track accuracy and failure patterns · https://smith.langchain.com
Supabase Edge Logs · Immutable audit trail for responses · https://supabase.com
Humanloop · Human-in-the-loop review workflows · https://humanloop.com

Copy-paste prompt block:

You are operating in a high-stakes context.
If facts are uncertain:
- state uncertainty
- do not speculate
- escalate when needed
Accuracy over speed.

💡 Free Office Hours

Want help implementing anything? Book a free 15-minute Office Hours slot—no sales pitch, just workflows solved.

→ https://calendly.com/aaron-cylentis/the-next-input-office-hours

Attention spans are shrinking. Get proven tips on how to adapt:

Mobile attention is collapsing.

In 2018, mobile ads held attention for 3.4 seconds on average.
Today, it’s just 2.2 seconds.

That’s a 35% drop in only 7 years. And a massive challenge for marketers.

The State of Advertising 2025 shows what’s happening and how to adapt.

Get science-backed insights from a year of neuroscience research and top industry trends from 300+ marketing leaders. For free.