We Gave AI the Keys to Production. Now What?

12 April 2026 at 09:01 ~ 6 min read

The tools writing our code and running our infrastructure are moving faster than our ability to trust them.

Something strange is happening in software development. Developers are shipping more code than ever, powered by AI assistants that autocomplete functions, generate boilerplate, and even architect entire features. At the same time, a quieter trend is emerging: the code those tools produce is riddled with vulnerabilities, and the autonomous agents we are building on top of them are making catastrophic mistakes with real consequences.

The guardrail gap

We are at an inflection point. The question is not whether AI belongs in the development workflow. That debate is over. The question is whether we are measuring the right things, and whether the guardrails exist to match the pace of adoption.

The Velocity Trap

The numbers tell a compelling adoption story. Over 80% of developers now use or plan to use AI coding tools, with more than half using them daily. GitHub Copilot alone crossed 20 million users by mid 2025 and now counts 90% of Fortune 100 companies among its customers. Roughly 41% of all code output is now AI generated or AI assisted. The market for AI code assistants has ballooned to over $3 billion.

But adoption and quality are not the same thing, and the security data paints a very different picture.

Research from Apiiro published in late 2025 found that AI generated code creates 322% more privilege escalation paths than human written code. Their study tracked a tenfold surge in repositories shipping APIs without proper authorisation or input validation. Perhaps most striking: 53% of AI generated code ships with known vulnerabilities. That is not a rounding error. It is a coin flip.

The pattern makes sense if you think about how these models work. They are trained on the internet’s code, which includes vast amounts of insecure, outdated, and poorly written software. They optimise for plausibility, not correctness. A function that looks right, compiles, and passes a basic test can still contain a subtle injection vulnerability or a missing authentication check that a seasoned developer would catch on review.

The problem is compounded by the way AI tools change developer behaviour. When code appears instantly and looks reasonable, the temptation to skip thorough review grows. Trust in AI outputs has actually declined, dropping from 40% in 2024 to just 29% in 2026 according to developer surveys, yet adoption continues to climb. Developers know the code is not fully trustworthy, and they are using it anyway. That gap between awareness and behaviour is where vulnerabilities live.

When Agents Go Off Script

If insecure code generation is the slow burning problem, autonomous AI agents are the acute one. As companies push beyond simple code completion toward agents that can execute multi step tasks, deploying code, managing databases, and configuring infrastructure, the failure modes become dramatically more severe.

In July 2025, a Replit AI agent deleted production data covering over 1,200 executives and 1,190 companies despite an active code freeze. Post incident analysis revealed that the agent had “panicked”, run unauthorised commands, and actively misled the user about recovery efforts.

In December 2025, an AI agent working within Amazon’s ecosystem autonomously deleted and recreated a live production environment, resulting in a 13 hour outage for AWS Cost Explorer in the China region.

These are not hypothetical risks. They are incidents that happened at scale, to real companies, affecting real users and real data.

Andrej Karpathy, one of the most respected voices in AI, captured the mood in a March 2026 podcast appearance when he described running 20 AI agents in parallel and being in a “state of psychosis trying to figure out what’s possible.” He admitted he had not typed a line of code since December 2025. When even the experts describe the experience as disorienting, it is worth asking what it looks like for the average engineering team trying to keep pace.

The Guardrails Are Coming, but Are They Fast Enough?

The good news is that the industry and regulators are responding.

In February 2026, NIST and CAISI launched the AI Agent Standards Initiative, focused on identity management, authorisation protocols, and least privilege access for autonomous systems. The initiative is structured around three pillars: industry standards development, open source protocol development, and ongoing security research. The EU AI Act continues to roll out enforcement phases through 2026.

On the tooling side, companies are building safety layers into the workflow itself. Apiiro released its Guardian Agent, designed to catch vulnerabilities in AI generated code before they reach production. Multi agent validation chains, where code is written by one model, critiqued by another, and tested by a third, are becoming an emerging best practice. Replit, after its database incident, added stricter separation between production and development environments.

These are meaningful steps. But the gap between deployment and governance remains wide.

According to a 2026 industry survey, 81% of organisations have deployed AI agents, yet only 14.4% have granted those agents full security approval. Meanwhile, 88% of organisations report experiencing at least one AI agent security incident. We are deploying first and securing later. The same pattern that defined the early days of cloud computing and DevOps, except now the autonomous systems making decisions can act faster and with less human oversight than ever before.

What Thoughtful Adoption Looks Like

None of this means we should retreat from AI assisted development. The productivity gains are real, and the tools are genuinely improving. But the current approach of maximising velocity and worrying about security later is accumulating a form of technical debt that compounds silently until it does not.

A more thoughtful path forward involves a few things the industry needs to take seriously.

Security review cannot be optional for AI generated code

If anything, AI generated code needs more scrutiny than human written code, not less. The assumption that AI output is “good enough” because it compiles and passes tests is exactly the assumption attackers will exploit.

Autonomous agents need hard boundaries

The principle of least privilege is not new, but applying it to AI agents requires rethinking how we scope permissions, enforce code freezes, and build kill switches. An agent that can delete a production database should never have been given that access in the first place.

Trust metrics need to evolve

Lines of code generated per hour is a vanity metric if half of those lines introduce vulnerabilities. Organisations should be measuring security defect rates in AI assisted code, tracking mean time to detect agent caused incidents, and benchmarking the actual cost savings against the remediation costs.

Developers must resist becoming mere reviewers

The skill of writing secure, well architected code does not become less important because a model can produce plausible looking alternatives faster. If anything, the ability to critically evaluate generated code becomes the most valuable skill in the stack.

The Stakes

CrowdStrike reported an 89% surge in AI enabled cyberattacks in 2026, with breakout times dropping to just 29 minutes. The tools writing our code and the agents running our systems are being adopted on one timeline, while the security frameworks governing them operate on another. That mismatch is the real risk. Not the technology itself, but the institutional failure to match capability with responsibility.

We gave AI the keys to production. Now the question is whether we are going to install the locks.