🎮 The Next Input — Issue #056

81% of Your Employees Are Leaking Data to AI

Aaron Bost
September 17, 2025

In partnership with

Giphy

⚡ The Briefing — 60 sec

Gemini tops the App Store with its new “nano banana” image model. All I can think of is Ludacris and that track Number One Spot.
Google launches AP2 — a new protocol for agent-driven purchases. MCP? That’s old news. Say hello to AP2.
81% of employees admit to sharing confidential info with free AI tools. Yikes. Don’t tell HR…

🛠️ The Playbook — Confidential AI Guardrail System

Mission Deploy a layered system that detects, blocks, and educates employees when they paste sensitive business data into external AI tools.
Difficulty Advanced | Build time 2–3 hours (pilot)
ROI Reduces risk of data leaks by ≈ 70% and saves compliance/legal teams 10+ hours of incident handling each month.

0) Why This Matters

81% of employees have already admitted to doing it. Once sensitive info is pasted into a free AI tool, it’s gone forever. Guardrails need to:

Stop leaks before they happen (prevention).
Catch edge cases when prevention fails (detection).
Teach employees why it matters (education).

1) Reference Architecture

Layer	Tooling	Purpose
Trigger	Browser extension / Slack interceptor / API gateway	Catch outbound prompts before they hit public models.
Scanner	TrellisGuard / RegEx + DLP rules	Detect PII, client names, contract terms, code snippets.
Policy Engine	Supabase / Airtable	Store rules: “Never paste X”, risk tiers, approved exceptions.
Advisor (LLM)	Claude 3.5 Sonnet	Explain to user: why it was blocked and safer alternatives.
Audit Log	BigQuery + Looker Studio	Track violations, trends, departments with highest risk.
Human Review	Slack queue for Compliance	Escalate edge cases to humans within minutes.

2) Workflow

Trigger
- Employee copies text into browser/Slack → extension intercepts payload.
Scan
- Regex/DLP rules check for PII, financials, client domains, contract clauses.
Policy Check
- Lookup against “Policy DB” (vendor → allowed? keyword → risk score).
Decision
- If clean: forward request to AI.
- If medium risk: show warning + require user confirmation.
- If high risk: block + show reason.
Educate
- Claude generates a short “why this was blocked” note with safer alternatives:
  “This snippet included a client’s contract clause. Instead, paste only the summary.”
Log
- Every event saved to BigQuery with {user, dept, content_hash, rule_triggered, outcome}.
Review
- Weekly Slack digest to Compliance with top 10 blocked attempts + trends.

3) Example Prompts

A) Block Message (Claude)

SYSTEM: You are a corporate compliance advisor.
INPUT: {blocked_text_snippet}, {rule_triggered}, {policy_explanation}.
TASK: Write a 2-sentence educational note:
1) Explain in plain English why the snippet was blocked.
2) Suggest a safer way to use AI without leaking data.
Tone: Helpful, non-punitive.

B) Slack Compliance Digest

## AI Guardrail Weekly Digest
- Total Intercepts: {n}
- High Risk Blocks: {x}
- Top Rule Triggers: PII (40%), Client Names (35%), Source Code (25%)
- Dept w/ Most Attempts: {dept}
### Recommended Actions
- Targeted training for {dept}
- Review whitelist vendors list

4) Guardrails & Policy Examples

Never Paste: PII (passport, SSN), client names, contracts, unreleased code.
Medium Risk: Generic source code, anonymized financials.
Safe: Summaries, public-facing marketing copy.
Thresholds:
- Risk ≤ 0.3 → Allow.
- 0.3–0.7 → Warn.
- ≥ 0.7 → Block + educate.

5) Pilot Rollout — 90 Minutes

Deploy browser extension on IT-managed Chrome with regex for “@client.com ” + “Confidential.”
Store blocked attempts in Airtable for quick visibility.
Add Slack app to intercept /chatgpt or /ai commands.
Run 1-week pilot in Finance & Legal teams.
Collect logs → measure:
blocks, # warnings, # escalations.
- Training effectiveness (drop in repeat offenders).

6) Metrics That Matter

Blocked Attempts / Week.
% Repeated Offenders.
Response to Education: How many users adopt safe workflows after 2 warnings.
Mean Time to Escalation: From violation to compliance review.

Pro tip: Make the education useful. A block is annoying; a block with a safe alternative builds trust and adoption.

🎯 The Arsenal — Tools & Prompts

Asset	What it does	Link
TrellisGuard	Detects PII/code in AI prompts.	https://trellisguard.ai
BigQuery	Centralised risk/audit logs.	https://cloud.google.com/bigquery
Looker Studio	Visual dashboards for compliance.	https://lookerstudio.google.com
Prompt · Guardrail Educator	Block → explanation → safe alternative.

This snippet triggered {rule}. Explain to the user why it was blocked and give a safer alternative in ≤3 sentences. Tone = helpful, not scary.

💡 Free Office Hours

Need an AI guardrail system for your org?
Book a free 15-minute Office Hours slot—no sales pitch, just workflows solved.

→ Grab a slot: https://calendly.com/aaron-cylentis/the-next-input-office-hours

Hear from leaders at Anthropic, Rocket Money, and more at Pioneer

Pioneer is a summit for the brightest minds in AI customer service to connect, learn, and inspire one another, exploring the latest opportunities and challenges transforming service with AI Agents.

Hear directly from leaders at Anthropic, [solidcore], Rocket Money, and more about how their teams customize, test, and continuously improve Fin across every channel. You’ll take away proven best practices and practical playbooks you can put into action immediately.

See how today’s service leaders are cultivating smarter support systems, and why the future of customer service will never be the same.

🕹️ Game Over

Ship one guardrail today—tomorrow’s HR report won’t include “81% of staff leaked confidential data.”
Share your win; you could headline Issue #057.

— Aaron
Automating the boring. Amplifying the brilliant.

Forwarded this? Subscribe here