EP 08May 28, 2026Enterprise AI SecurityPrompt InjectionAI Agent RiskSecurity Governance

The AI Agent Security Audit You Haven't Done

Enterprise AI agents with tool use — web browsing, email, database access, code execution — are in production across organizations that reviewed them as chat models. This episode dissects why prompt injection is the primary unsolved threat, how tool permission creep happens silently across development cycles, and what a structured AI agent security program actually looks like to build.

The Deployment Debrief · Host: Elise · AI Insight Lab

Read the memo →View slide deck →All episodes →

ShareLinkedIn X

Key takeaways

1
An AI agent with tool use is not the same security problem as an AI model without it — the threat surface, incident attribution model, and governance requirements are categorically different.
2
Prompt injection is unsolved at scale: OWASP LLM Top 10 documents that over 90% of major frameworks lack reliable defenses, and your production agents are not exceptions.
3
Tool permission creep happens silently across development iterations — the permissions granted at deployment rarely match the permissions in use six months later.
4
The EU AI Act's 72-hour incident reporting window requires a workflow most enterprises have not built — and AI agent incidents are the most likely trigger for that obligation.

The Deployment Memo

One enterprise AI deployment, dissected every Tuesday.

Every issue covers the same format as this episode: what broke, why it broke, and how to avoid it before it happens to you.

Episode sections

Hook & Context

Why enterprise AI security reviews written for chat models don't cover the threat surface that tool-using agents create — and why most organizations don't realize the gap yet.

The Shift from Chat to Agents

What changed between 2023 and 2024: from models that generate text to agents that execute SQL, send email, browse the web, and write files — and why that distinction changes the entire security model.

Prompt Injection: The Primary Threat

Why over 90% of major LLM frameworks lack reliable prompt injection defenses, how it works operationally, and what a successful attack looks like in a production enterprise environment.

How the Governance Gap Formed

Why the security review that approved your LLM deployment didn't include tool-use, service account permissions, or incident attribution — and how that gap compounds with each new agent deployment.

The Hidden Dynamics

Tool permission creep across development cycles, the absence of standard service account review frameworks for AI agents, and why EU AI Act incident reporting requirements have no corresponding workflow at most enterprises.

Three Audit Postures

Reactive (post-incident), compliance-driven (regulation-first), and proactive (threat-model-first) — what each looks like operationally and which one your organization is implicitly running right now.

The Five-Step Security Program

The specific audit framework: agent inventory, tool permission review, prompt injection testing, incident attribution workflow, and insurance coverage gap analysis — what each step requires and who owns it.

Closing Question

The six questions your CISO cannot currently answer about your enterprise AI agents — and why the inability to answer them is itself a material security finding.

← Previous

EP07

The Open-Source Sovereign AI Decision

EP09

The AI Clinical Note Your Physician Didn't Write — and Signed Anyway

The AI Agent Security Audit You Haven't Done

The Deployment Debrief · Host: Elise · AI Insight Lab

Key takeaways

An AI agent with tool use is not the same security problem as an AI model without it — the threat surface, incident attribution model, and governance requirements are categorically different.

Prompt injection is unsolved at scale: OWASP LLM Top 10 documents that over 90% of major frameworks lack reliable defenses, and your production agents are not exceptions.

Tool permission creep happens silently across development iterations — the permissions granted at deployment rarely match the permissions in use six months later.

The EU AI Act's 72-hour incident reporting window requires a workflow most enterprises have not built — and AI agent incidents are the most likely trigger for that obligation.

Episode sections

Hook & Context

Why enterprise AI security reviews written for chat models don't cover the threat surface that tool-using agents create — and why most organizations don't realize the gap yet.

The Shift from Chat to Agents

Prompt Injection: The Primary Threat

Why over 90% of major LLM frameworks lack reliable prompt injection defenses, how it works operationally, and what a successful attack looks like in a production enterprise environment.

How the Governance Gap Formed

Why the security review that approved your LLM deployment didn't include tool-use, service account permissions, or incident attribution — and how that gap compounds with each new agent deployment.

The Hidden Dynamics

Three Audit Postures

The Five-Step Security Program

Closing Question

The six questions your CISO cannot currently answer about your enterprise AI agents — and why the inability to answer them is itself a material security finding.