šŸŽ® The Next Input — Issue #120

Claude Is Now Your Slack Intern

In partnership with

Snl Internship GIF by Saturday Night Live

⚔ The Briefing — 60 sec

šŸ› ļø The Playbook — The Workplace Agent Mesh

Missionā€ƒTurn Slack into a controlled agent layer that triages, drafts, and executes work across your tools—without ā€œbot sprawl.ā€ Difficultyā€ƒAdvanced Build timeā€ƒ3 hours ROIā€ƒ6 hrs/week saved → 2 hrs/week (measured via Slack + ticket timestamps) with fewer dropped handoffs.

0) Why This Matters

Agents inside workplace tools are inevitable—Slack is becoming the control plane. The risk is letting 15 ā€œhelpful botsā€ quietly turn into 15 unowned systems making untracked changes. This mesh gives you speed and accountability: who did what, where, and under which rules.

1) Architecture

Component

Tool

Purpose

Owner

Failure mode

Entry point

Slack (channels + slash commands)

Where work is requested and confirmed

Ops lead

Requests scatter across DMs, no audit trail

Router

GPT-5-mini

Decide which agent handles the request

Eng lead

Wrong routing → duplicated work or missed tasks

Executors

Claude 4.5 Sonnet

Perform scoped tasks (draft, summarise, extract, propose)

Function owners

ā€œHelpfulā€ outputs that exceed scope

Permissions

Open Policy Agent

Enforce role-based actions + approvals

Security

Permission creep / agents doing writes

Observability

Langfuse

Trace prompts, actions, failures, drift

Eng

Silent failures and repeated mistakes

System bridge

Internal APIs / webhooks

Read/write to tools (Jira, CRM, docs)

Platform

Write actions without confirmation

2) Workflow

  1. Request intake (Slack): User posts in #ops-requests or runs /agent help with the request.

  2. Scope check: Router labels the request type: research, draft, ops-change, finance, people, unknown.

  3. Decision gate:

    • If unknown → ask one clarification question and stop.

    • If finance or people → default to draft-only + require approval.

  4. Routing: GPT-5-mini selects the correct agent (single agent only).

  5. Execution: Claude 4.5 Sonnet agent completes one task: draft, summarise, extract, or propose steps.

  6. Write gate (critical): If the task touches a system change (ticket updates, record edits, deployments):

    • If confidence < 0.8 → do not write; return a proposed patch + ask for approval.

    • If confidence ≄ 0.8 → still require explicit /approve for writes.

  7. Action + log: Once approved, the bridge executes and logs: input, output, diff, actor, timestamp.

  8. Close loop: Agent posts a final Slack summary with: what changed, where, and rollback path.

3) Example Prompts

Router (GPT-5-mini)

Classify and route this Slack request.

Return JSON:
{
  "category": "research|draft|ops-change|finance|people|unknown",
  "best_agent": "name",
  "needs_clarification": true/false,
  "one_question_if_needed": "",
  "requires_approval": true/false,
  "risk_level": "low|medium|high"
}

Rules:
- If category is finance or people, requires_approval = true.
- If uncertain, category = unknown and ask one question.

Executor (Claude 4.5 Sonnet)

You are a scoped workplace agent. Complete ONE task only.

Task: {task}
Constraints:
- Do not write to external systems.
- If information is missing, ask one question and stop.
Output:
1) Result
2) Assumptions (if any)
3) Next action + whether approval is required
Tone: concise, operator-friendly.

Eval (Claude 4.5 Haiku)

Evaluate the agent output for:
- scope compliance
- missing steps
- risky assumptions
Return: PASS / FLAG / FAIL
If FLAG/FAIL, provide the minimal fix.

4) Guardrails

  • Draft-only by default. Writes require explicit approval.

  • One agent, one job. No ā€œgeneral assistantā€ agents.

  • No silent permissions. Any new permission requires a human owner sign-off.

  • Immutable audit trail for every action and diff.

  • Hard block on sensitive domains (HR, health, legal) unless configured + approved.

5) Pilot Rollout — 3 hours

  1. Create Slack channels: #agent-requests, #agent-audit, #agent-incidents.

  2. Stand up 3 agents only: Research, Comms Drafting, Ops Triage.

  3. Implement routing + eval prompts and force draft-only.

  4. Run 20 real requests through the system.

  5. Success criteria: ≄80% correct routing, ≤2 minutes to first draft, 0 unapproved writes, rollback tested once.

  6. Expand to one write-capable workflow with /approve (e.g., Jira ticket updates).

6) Metrics

  • Time to first useful output: 10 min → 2 min (Slack timestamps)

  • Correct routing rate: 60% → 85% (manual sample of 20/week)

  • Unapproved write attempts: baseline unknown → 0 (OPA policy logs)

  • Task completion time (ops triage): 45 min → 20 min (ticket lifecycle)

  • Repeat failure rate: 5/week → 1/week (Langfuse error tags)

Pro Tip: If an agent asks for ā€œjust one more permission,ā€ that’s not progress—that’s scope creep.

šŸŽÆ The Arsenal — Tools & Prompts

Copy-paste prompt block:

You are a workplace agent operating under strict controls.

Rules:
- Draft-only unless explicitly approved.
- One task at a time.
- If unclear, ask ONE question and stop.
- Never assume permissions.
- Always return: result, assumptions, next action, rollback.

If the task touches money, people, or policy: require approval.

šŸ’” Free Office Hours

Want help implementing anything? Book a free 15-minute Office Hours slot—no sales pitch, just workflows solved.

Free, private email that puts your privacy first

A private inbox doesn’t have to come with a price tag—or a catch. Proton Mail’s free plan gives you the privacy and security you expect, without selling your data or showing you ads.

Built by scientists and privacy advocates, Proton Mail uses end-to-end encryption to keep your conversations secure. No scanning. No targeting. No creepy promotions.

With Proton, you’re not the product — you’re in control.

Start for free. Upgrade anytime. Stay private always.

šŸ•¹ļø Game Over

More agents doesn’t mean more chaos—unless you let it.

— Aaron Automating the boring. Amplifying the brilliant.