[PRO SERVICES / SECURITY & GOVERNANCE]

Agentic AI Governance
for Established Businesses

Once AI agents can send messages, query data or change company systems, policy and engineering have to agree. We map authority, ownership and evidence, then put the controls into the workflows themselves.

BOOK A GOVERNANCE CALL WHERE THIS BITES

Policy, guardrails, evals

40%

OF AGENTIC AI PROJECTS CANCELLED BY 2027 (GARTNER)

97%

OF AI BREACHES LACKED ACCESS CONTROLS (IBM 2025)

63%

OF BREACHED ORGS LACK AI GOVERNANCE (IBM 2025)

[THE OPERATING PROBLEM]

Governance changes when AI can take action

A chatbot writes a draft and a human presses send. An agent reads its own email, replies, books the call, updates the CRM and refunds the customer. Roughly the same model underneath, very different blast radius.

A general AI policy may cover staff use of public chat tools. It rarely defines how an agent gets production access, which actions need approval or who reviews failures.

Anything that can act on your behalf needs a policy, the permissions, and logs.

ASSISTIVE AI

Suggests, a human acts
Blast radius = one screen
Mistake caught at copy-paste
Acceptable-use policy covers it
Risk reviewed once a year

AGENTIC AI

Decides and acts, autonomously
Blast radius = every tool it owns
Mistake hits customers in seconds
Needs policy, permissions, evals, logs
Risk reviewed each time it changes

[THE PATTERN]

Where agent access goes wrong

These risks come from the OWASP Top 10 for Agentic Applications and published incidents. We use them to test the access, controls and records around each live agent.

Prompt injection

An attacker hides instructions in an email, a PDF or a web page the agent reads. The agent does what the attacker said, not what you said. OWASP LLM01, and the root cause of EchoLeak (CVE-2025-32711) in Microsoft 365 Copilot.

Excessive agency

Service account with full DB write, deploy keys, and a Stripe token, all wired to one agent because that was simplest in the demo. Principle of least agency exists for a reason. So does Replit's deleted database.

Tool misuse

The agent has the tools you gave it, but uses them in a combination you never tested. Send-email plus list-contacts plus a hallucinated CC field is one missed eval away from a GDPR notification.

No observability

No log of which prompt, which tool call, which output. When a customer asks why your agent refunded their competitor, you can't answer. You also can't show the ICO how the decision was made.

Shadow agents

Marketing's on Zapier with a Claude key. Sales built a GPT that reads the CRM. Ops wired n8n to the warehouse API. Nobody in IT knows. IBM's 2025 breach report puts the shadow-AI premium at $670,000 a breach.

[THE FRAMEWORKS]

Standards relevant to business agent systems

The applicable standard depends on the system, data, sector and territories involved. We map each agent to the useful controls below and record which decisions still belong with your security, privacy and legal teams.

NCSC Guidelines for Secure AI System Development

Joint UK and US guidance, co-signed by 23 agencies (Nov 2023). Four stages: secure design, development, deployment, operation. The closest thing the UK has to a default baseline.

DSIT AI Cyber Security Code of Practice

Published 31 Jan 2025. Thirteen voluntary principles across five lifecycle stages. Now the basis for ETSI TS 104 223. Voluntary today, probably not in a couple of years.

US / GLOBAL

NIST AI RMF 1.0 + GenAI Profile

Govern, Map, Measure, Manage. The Generative AI Profile (NIST AI 600-1, Jul 2024) names 12 GenAI risk categories and 200+ suggested actions. The clearest checklist anyone has published.

ISO

ISO/IEC 42001:2023

AI management system standard. Certifiable. Useful if your enterprise customers are starting to ask for it on the procurement questionnaire. Overkill if they aren't.

OWASP

OWASP Top 10 for Agentic Applications

Released 9 Dec 2025 by the OWASP GenAI Security Project. The first list ranked by what's breaking agents in production. Ten ASI categories. If you only read one of these, read this one.

EU AI Act (and what touches you)

Prohibited-practice rules have applied since February 2025 and provider obligations for general-purpose AI models since August 2025. We establish whether the business is acting as a provider, deployer or both before mapping the relevant duties.

Sources: ncsc.gov.uk, gov.uk DSIT, nist.gov AI RMF, iso.org, genai.owasp.org, digital-strategy.ec.europa.eu.

[HOW WE WORK]

What the review gives you

We map the agents you've got, write a policy people will read, and put the controls in the code so they hold under load.

Fixed scope per phase, days rather than quarters. You keep the artefacts and the code.

BOOK A GOVERNANCE CALL

Agent inventory and risk map

We find every agent already running in your business. The ones IT knows about, the ones the marketing team built in Zapier, the GPTs running on personal accounts. For each one: what it can do, what data it touches, what it costs you if it goes wrong. Mapped against NIST AI RMF and the OWASP Agentic Top 10.

Policy people will follow

A usable policy covering what agents may do, which actions require approval, who can authorise a new agent and what must be logged. It is aligned to the controls relevant to your systems and written for the teams expected to follow it.

Guardrails in the code

Permissions scoped to the task. Tool calls validated server-side. Prompt injection filters on anything the agent reads. Approval steps for the high-stakes actions. Rate limits, spend caps, kill switches. The bits that survive contact with reality.

Evals, logs, red-team

A repeatable test suite for each agent: does it still refuse the things it should refuse, after every prompt change. Full traces of every prompt, tool call and decision in a searchable log. One round of adversarial testing before go-live. You leave with an audit trail you can show a customer, a regulator or your insurer.

[PUBLISHED EXAMPLES]

Published agent failures from 2025

These published incidents show what happens when an agent's access, instructions and recovery controls are weaker than the job it has been given.

JUL 2025

Replit agent wipes a production DB.

SaaStr founder Jason Lemkin's Replit agent deleted a production database holding 1,206 executive records and 1,196 company records on day 9 of a trial. The agent had been told it was under a code freeze. It ran the commands anyway. Excessive agency, in one line. Source: Fortune, AI Incident Database #1152.

CVE-2025-32711

EchoLeak, zero-click exfiltration.

Aim Security showed Microsoft 365 Copilot could be tricked into reading internal files and leaking them with one crafted email and zero user clicks. The agent followed instructions hidden in the email body. Patched and disclosed June 2025. First documented production zero-click prompt injection.

CVE-2025-8217

Amazon Q tries to wipe a million laptops.

An attacker landed a malicious pull request into the aws-toolkit-vscode repo, injecting a prompt telling Amazon Q to delete the user's files and AWS resources. Released to roughly a million installs as v1.84.0. Only a syntax error in the injected payload stopped it executing. Source: NVD, The Register, July 2025.

Sources: NVD, Fortune, The Register, AI Incident Database, Aim Labs.

[RELEVANT VU WORK]

Raq.com is our live governance environment

It coordinates people and agents through explicit accounts, tools, permissions and records. We use it to run real work, which exposes the operational governance problems that a policy document alone will miss.

SEE THE LIVE APPROACH

[A USEFUL FIRST CONVERSATION]

When this is worth discussing

We work best when there is a real operating problem, enough volume to measure and people from the affected teams who can make decisions.

Usually a good fit

An established UK business, usually with annual revenue above £10m
A repeated process with a known cost, delay, error rate or capacity problem
A senior sponsor and a day-to-day owner who understand the work
Access to the relevant staff, systems, sample records and security requirements

We may point you elsewhere

A standard product already covers the process well
The requirement is a one-off small build with no wider operating case
There is no owner or access to the people and data needed to test the result
The plan relies on AI making high-impact decisions with nobody responsible for review

[QUESTIONS]

Questions from IT, legal and compliance

Q.01

We're 30 people. Isn't this overkill?

The governance should match the agent's authority, data and possible impact. A business needs to know which credentials each agent holds, what actions it may take, who approved that access and how an incident would be detected and stopped.

Q.02

Does the EU AI Act apply to us?

It depends on where the system is placed, used and what decisions it supports. We map the role of your business and the use case before stating which obligations apply. Legal interpretation remains with your legal adviser.

Q.03

Do we need ISO/IEC 42001?

Only if a customer is asking for it. Enterprise buyers are starting to put it on procurement questionnaires, and you can lose a tender for not having it. If nobody's asking, we'll align your controls to it informally so you can answer the question when it comes, without paying for certification you don't need yet.

Q.04

What's the difference between governance and security?

Security stops someone from breaking in. Governance decides who's allowed to do what when nobody is breaking in. Agents need both because they have their own credentials, take their own actions, and don't tend to ask permission. We do both in one engagement.

Q.05

What about agents staff are using on their own accounts?

Shadow AI. We always find it. IBM put the cost premium at $670,000 per breach where shadow AI was involved. The inventory phase pulls them into the open, and the policy gives staff a sanctioned path so they don't have to hide. Bring the agents in, don't try to ban them.

Q.06

How long does it take and what does it cost?

Inventory and accountability come first. The timetable then depends on the number of agents, connected systems, decision risk and evidence required by technology, compliance or internal audit.

Q.07

We haven't deployed any agents yet. Too early?

No, it's the right moment. Governance retrofitted onto live agents is twice the work. If you're about to launch the first one, we'll set up the policy, the patterns and the evals now so every agent after this one is built on the same rails.

Talk to us about agent governance

Tell us what the agents can read, decide and change today, plus what is due to launch next. We will identify the governance decisions and technical evidence needed before wider use.

BOOK A GOVERNANCE CALL SEE ALL SERVICES

Agentic AI Governancefor Established Businesses