You've deployed AI agents. Now who's governing them?

← Back to Perspectives

By mid-2026, 54% of enterprises have at least one AI agent running in production. The agents are booking meetings, triaging cases, processing invoices, handling procurement approvals. They are making decisions at a pace and volume that no human team could match.

Only 21% of those enterprises have a mature governance model for them.

This is not a regulatory compliance problem. Regulation for autonomous AI is still emerging in most jurisdictions. This is an operating model problem — and it is the single largest unaddressed risk in most enterprise AI programmes right now.

What governing an AI agent actually means

Most organisations have conflated AI governance with AI compliance. They have ethics frameworks, they have data protection addenda, they have vendor contracts with liability clauses. None of that answers the operational question: when this agent makes a bad decision at 3 a.m. on a Friday, who knows, who is responsible, and what do they do?

Governing an AI agent in production requires four things, and they need to be designed before deployment, not retrofitted after the first incident.

Decision rights

Every action the agent can take should have a defined authority level. Some decisions — customer communication, internal routing, data retrieval — can be fully autonomous. Others — refunds above a certain value, external commitments, exceptions to policy — should require human approval. The boundary needs to be explicit, documented, and enforced in the system design, not left to the agent's discretion.

Escalation thresholds

Agents encounter situations their training did not cover. What happens then? Most deployments handle this poorly: the agent either fails silently, produces a hallucinated response, or loops. A well-governed agent has defined escalation paths — to a human, to a supervisor queue, to a rejection state — that trigger when the agent's confidence or authority is insufficient. These thresholds need calibration against real operational data, not vendor defaults.

Audit trails

An AI agent that cannot explain what it did and why is ungovernable. Every material action should be logged with sufficient context to reconstruct the decision: what inputs the agent received, what tools it used, what output it produced, and what the confidence level was. This is not about blame allocation after failures. It is about continuous improvement, regulatory readiness, and the ability to identify drift before it becomes a crisis.

Drift monitoring

AI agents degrade over time. The world changes, your data changes, your processes change, and an agent calibrated in January is not the same agent in October. Most organisations do not have systematic monitoring for agent performance drift. They notice it when customer complaints spike or when someone in the business realises the agent has been routing incorrectly for weeks. By then, the problem is already embedded.

The cancellation paradox

Gartner projects that over 40% of agentic AI projects will be at risk of cancellation by 2027. This sits alongside a separate finding that 80% of enterprises with deployed AI agents report measurable return on investment.

These numbers are not contradictory. They describe the same situation from different vantage points.

The ROI is real but fragile. Organisations are seeing genuine efficiency gains — faster processing, fewer handoffs, lower cost per transaction. But the governance infrastructure to sustain those gains is absent. When the first significant failure occurs — a mis-handled customer, a regulatory audit, an edge case that produces a wrong output at scale — organisations without governance infrastructure have no choice but to pull the agent back. The ROI disappears with it.

Projects that survive are not necessarily the ones with better technology. They are the ones where someone designed the operating model before they deployed the agent.

Why governance gets skipped

The pattern is consistent across the programmes we see. Governance gets designed out of the build for one of three reasons.

The first is time pressure. The pilot worked and the business wants it scaled. The six-week governance design phase gets compressed into a two-week addendum that produces documentation nobody reads.

The second is ownership ambiguity. The technology team built the agent. The business operations team runs the process. The risk team owns the framework. Nobody owns the intersection. When the governance design starts requiring decisions about authority levels and escalation paths, everyone defers to everyone else.

The third is the misplaced assumption that compliance equals governance. An approved vendor, a signed DPA, and an ethics checklist give organisations a sense of oversight that does not translate into operational control of a deployed agent.

What to do about it

If you have AI agents in production without the four governance elements above, the priority order is: audit trails first, escalation thresholds second, decision rights third, drift monitoring fourth. Audit trails are retrospective, so you can build them quickly without touching the agent's behaviour. Escalation thresholds come next because they protect against the worst failure modes. Decision rights and drift monitoring are longer builds but prevent the structural failures that cause cancellation.

If you are planning an agentic deployment, build the governance design in parallel with the technical architecture. The governance model should be in a first draft before you write the first line of agent code. It will change — the operational reality of a deployed agent always reveals gaps in the design — but you need a baseline to change from.

The organisations that sustain their AI agent programmes through the next two years will not necessarily have deployed first or moved fastest. They will be the ones that treated governance as a design problem, not a compliance afterthought.