Home > Security and Compliance Blog > Cybersecurity Risk Management > AI Agents Are the Biggest Data Security Threat You’re Not Governing

AI Agents Are the Biggest Data Security Threat You’re Not Governing

by Patrick Spencer updated February 25, 2026 Cybersecurity Risk Management

Reading Time: 12 minutes

Something quietly changed in enterprise security over the past year, and most organizations missed it.

AI agents — autonomous systems that reason, act, and interact with enterprise resources independently — moved from pilot programs into production workflows. They’re adjusting cloud resource allocations, executing multi-step business processes, querying databases, and interacting with external APIs. They’re doing this with substantial independence, at machine speed, and often without continuous human oversight.

Table of Contents

And here’s the part that should concern every CISO: the security infrastructure governing these agents hasn’t kept pace with their deployment. Not even close.

Microsoft’s Cyber Pulse report confirms that more than 80% of Fortune 500 companies now deploy active AI agents, many built with low-code and no-code tools that put agent creation in the hands of business users, not just developers. The same report delivers a stark warning: these agents are “scaling faster than some companies can see them.” That visibility gap isn’t a minor inconvenience. Microsoft calls it a business risk requiring urgent governance and security.

The question is no longer whether your organization will use AI agents. It’s whether you’ll govern them before they create a breach you can’t contain.

5 Key Takeaways

AI Agents Are Scaling Faster Than Security Teams Can See Them. More than 80% of Fortune 500 companies now deploy active AI agents built with low-code and no-code tools. Microsoft’s Cyber Pulse report warns these agents are “scaling faster than some companies can see them,” creating a visibility gap the report explicitly calls a business risk. When autonomous systems can reason, act, and access enterprise resources independently, the absence of governance is not a gap — it’s an open door.
One in Three Organizations Considers Unsupervised AI Agent Data Access a Critical Threat. According to Proofpoint’s 2025 Data Security Landscape report, 32% of organizations identify unsupervised data access by AI agents as a critical threat. These agents often operate as highly privileged “superusers,” accessing sensitive data across cloud and hybrid environments with far less oversight than any human employee would receive.
AI Agents Can Be Weaponized Through Hidden Instructions — With Zero User Interaction. Trend Micro demonstrated that multi-modal AI agents can be manipulated through hidden instructions embedded in images or documents, causing sensitive data exfiltration with zero clicks from the user. Researchers on arXiv built an end-to-end exploit where a malicious blog page’s hidden instructions caused a RAG-based agent to retrieve secrets from its knowledge base and send them to an attacker-controlled server.
44% of Organizations Lack Adequate Oversight of Generative AI Use. Proofpoint found that 44% of organizations admit they do not have adequate oversight of generative AI use, including tools and agents. The Cloud Security Alliance reports a sharp awareness gap: 52% of C-suite executives self-report familiarity with AI technologies compared to just 11% of frontline staff — the very people using these tools daily.
The Governance Gap Is a Financial Time Bomb. The average data breach now costs $4.88 million (IBM Cost of a Data Breach Report, 2024). EU AI Act fines reach up to €35 million or 7% of global annual revenue. GDPR penalties hit €20 million or 4% of revenue. With AI privacy incidents surging 56.4% year-over-year (Stanford 2025 AI Index Report), organizations that lack demonstrable AI governance face compounding financial and legal exposure.

The Agentic Workspace Is Here — and Most Organizations Can’t See What’s Happening Inside It

Proofpoint’s 2025 Data Security Landscape report introduced a term that captures the current moment perfectly: the “agentic workspace.” It describes an operating environment where AI-driven productivity tools and autonomous agents handle sensitive data alongside humans. The report’s conclusion is blunt — many organizations “lack the visibility and controls to govern this emerging agentic workspace.”

The numbers behind that conclusion tell a sobering story. Nearly half of respondents identify data sprawl across cloud and hybrid environments as a top concern. Two in five organizations cite data loss via public or enterprise generative AI tools as a primary worry. Over one-third are concerned about sensitive data being used in AI model training. And 32% of organizations see unsupervised data access by AI agents — often operating as highly privileged “superusers” — as a critical threat.

What makes this especially dangerous is the gap between perception and reality. The Cloud Security Alliance found that 52% of C-suite executives self-report familiarity with AI technologies, but only 11% of staff say the same. That’s a governance chasm. The people making decisions about AI policy often overestimate their organization’s readiness, while the people actually using AI tools lack the understanding to use them safely.

Meanwhile, 44% of organizations admit they do not have adequate oversight of generative AI use. That includes the tools employees use daily and the agents increasingly embedded in enterprise workflows.

You Trust Your Organization is Secure. But Can You Verify It?

Read Now

How AI Agents Get Weaponized: Prompt Injection and Data Exfiltration

If the visibility gap were the only problem, it would be manageable. But AI agents don’t just create passive exposure. They can be actively weaponized — and the research proving it is both recent and alarming.

Trend Micro’s research on AI agent vulnerabilities demonstrated that multi-modal agents can be manipulated through hidden instructions embedded in images or documents. The attack requires zero user interaction. A document or image containing concealed prompts can cause an agent to exfiltrate sensitive data — including personal data like names, Social Security numbers, and contact details, financial information, protected health information, business secrets, authentication credentials, and confidential uploaded documents — without the user ever knowing it happened.

The attack surface extends to the web. Trend Micro showed that web-parsing agents can read embedded malicious prompts on websites, which instruct the agent to exfiltrate memory-stored data — such as API keys or contact details — to attacker-controlled destinations. If outbound traffic is allowed, the agent becomes an unwitting data pipeline to the attacker.

Researchers publishing on arXiv took this further. They built a complete end-to-end exploit against a retrieval-augmented generation (RAG) based AI agent that used tools to query an internal knowledge base containing sensitive project secrets and to retrieve external web content. A malicious blog page embedded hidden white-on-white instructions that human users couldn’t see. When the agent was asked to summarize the page, it ingested these instructions and executed them — retrieving the secret from its knowledge base, embedding it into a URL parameter, and sending it to an attacker-controlled server using the same web search tool it was designed to use for legitimate purposes.

The paper’s conclusion deserves attention from every security leader: current LLM agents with tool use and RAG exhibit a “fundamental vulnerability” to indirect prompt injection attacks, and built-in model safety features are insufficient without additional defensive layers.

This is not theoretical. These are demonstrated exploits against the exact type of agent architecture that enterprises are deploying today.

Three Vulnerabilities That Compound Into a Crisis

The AI agent risk landscape can be understood through three compounding vulnerabilities that together create unprecedented enterprise exposure.

The first vulnerability is excessive data access — the sheer amount of data agents can reach. AI agents require broad data access to function effectively. But that access creates a massive exposure surface. Organizations average 15,000 stale-but-enabled accounts with over 31,000 stale permissions (Varonis 2025 State of Data Security Report), and every AI agent deployed adds a non-human identity to this already sprawling attack surface. Traditional identity and access management was built for people, not machines. It often lacks the granularity to enforce least-privilege access for autonomous systems that can request, process, and transmit data at speeds no human user ever could.

The second vulnerability is uncontrolled data usage — the types of data agents can process and where that data ends up. Research shows that the largest group of companies — 27% — admit more than 30% of information sent to AI tools contains private data, including Social Security numbers, medical records, credit card information, and protected intellectual property. Another 17% have no visibility into what employees share at all. Once this data enters a public model’s training set, it cannot be retrieved, deleted, or controlled. The contamination is permanent.

The third vulnerability is agent manipulation — the exploitation of agents through prompt injection, malicious skills, and supply chain attacks. AI agents that interact with external services, plugins, and agent networks are vulnerable to the same kind of indirect prompt injection attacks that Trend Micro and the arXiv researchers demonstrated. A compromised AI skill can spread across agent networks within hours. Autonomous agents can be manipulated into exfiltrating credentials, accessing sensitive files, and creating compliance violations — all at machine speed, far faster than any human insider threat.

These three vulnerabilities don’t exist in isolation. They compound. An agent with excessive access that processes uncontrolled data and is vulnerable to manipulation isn’t just a risk — it’s a breach waiting to happen.

Shadow AI Makes Everything Worse

Layered on top of these structural vulnerabilities is the shadow AI problem. Nearly half of generative AI users rely on personal, unsanctioned AI applications that operate entirely outside organizational visibility. Employees routinely upload source code, regulated data, and intellectual property to these tools for summarization, debugging, and content generation — often without realizing the data may be used to train public models.

The scale is staggering. The average organization experiences 223 AI-related data policy violations per month, with source code accounting for 42% of incidents and regulated data representing 32% (Netskope Cloud and Threat Report 2026). AI privacy incidents increased 56.4% year-over-year (Stanford 2025 AI Index Report). And 98% of companies have employees using unsanctioned applications, averaging 1,200 unofficial apps per organization (Varonis 2025 State of Data Security Report).

Only 17% of organizations have technical controls that block access to public AI tools combined with DLP scanning. Another 40% rely on training and audits alone. And 13% lack any policies at all.

Blocking AI entirely is not the answer — and every organization that has tried knows it. Employees find workarounds. They use personal accounts. They upload data to free-tier tools from their phones. The result isn’t less AI use. It’s invisible AI use, which is far more dangerous.

The Governance Gap Is Wider Than Organizations Admit

Despite these escalating risks, most organizations remain unprepared. Only 12% have dedicated AI governance structures, while 55% lack any framework whatsoever. Only 9% achieve what analysts consider a “ready” level of AI governance maturity, despite 23% claiming to be “highly prepared” — a 14-point overconfidence gap that itself represents a risk.

Meanwhile, 86% of organizations lack visibility into AI data flows, and 45% cite pressure to deploy quickly as their biggest obstacle to governance. Among technical leaders, that figure jumps to 56%. The result is a pattern playing out across industries: organizations deploying AI agents into production while their governance, security, and compliance infrastructure is still designed for a world where humans were the only ones accessing sensitive data.

The regulatory environment isn’t waiting for organizations to catch up. Fifty-nine new data privacy regulations were enacted in the past year alone. The EU AI Act imposes fines of up to €35 million or 7% of global annual revenue for high-risk violations. GDPR penalties can reach €20 million or 4% of revenue. Sector-specific mandates under HIPAA, SOX, GLBA, and CMMC add further compliance obligations that intersect directly with how AI agents access, process, and transmit sensitive data.

Zero Trust for AI Agents: The Security Model That Fits

Microsoft’s Cyber Pulse report frames the solution in terms most security professionals already understand: Zero Trust. The report applies the same Zero Trust principles to agents that organizations apply to human users — least-privilege access, explicit verification of “who or what” is requesting access, and an assumption of compromise as a design principle.

This framework makes intuitive sense. AI agents are identities. They authenticate, they request access, they take actions. The fact that they aren’t human doesn’t make them less dangerous — it makes them more dangerous, because they operate at speeds and scales that human-focused security controls were never designed to handle.

Applying Zero Trust to the agentic workspace means every AI agent must be treated as a distinct identity requiring authentication and authorization. Access must be scoped to the minimum permissions required for each specific task. Every data interaction must be logged in an immutable audit trail. Anomaly detection must operate at machine speed to match the speed of agent operations. And outbound data flows must be governed to prevent exfiltration — whether initiated by a compromised agent, a manipulated prompt, or a misconfigured workflow.

Trend Micro’s research reinforces this approach, recommending robust access controls, advanced content filtering, and real-time monitoring to reduce data leakage and unauthorized actions. The arXiv researchers echo the same conclusion: built-in model safety features are not enough. Additional defensive layers are required.

What the Right Infrastructure Looks Like

Securing the agentic workspace requires infrastructure that operates at the data layer, not just the network layer. Network-level security — inspecting traffic as it passes through proxies — can detect that an employee visited an AI application. But it cannot govern what specific data an AI agent accesses within enterprise repositories, enforce granular policies about how that data is used, or provide the content-level audit trail that regulators increasingly demand.

The infrastructure needed to close the AI governance gap has several essential characteristics. It must provide a secure gateway between AI systems and enterprise data where zero-trust principles are enforced at every interaction. It must sandbox AI agent execution so that compromised plugins or skills cannot access resources beyond their authorized scope. It must extend existing governance frameworks — role-based and attribute-based access controls — to all AI interactions, including those initiated by autonomous agents. It must log every AI-data interaction in an immutable audit trail with user identity, timestamp, data accessed, and the AI system used. And it must detect anomalies at machine speed, flagging behaviors like an agent suddenly requesting large volumes of data it doesn’t normally access or attempting to transmit data to unusual destinations.

The Kiteworks Private Data Network is purpose-built for this challenge. Its AI Data Gateway creates a zero-trust bridge between AI systems and enterprise data repositories, ensuring data never leaves the protected environment. Its Secure MCP Server sandboxes AI agent execution with OAuth 2.0 authentication, anomaly detection, and governance framework enforcement. And its unified multi-channel governance covers file sharing, managed file transfer, email, web forms, APIs, and AI interactions under a single policy engine with a single immutable audit trail.

For organizations in regulated industries, deployment flexibility matters. Kiteworks supports on-premises, private cloud, hybrid, and FedRAMP High environments — with pre-mapped compliance controls for HIPAA, SOX, GDPR, CCPA, CMMC, NIST CSF, ISO 27001, and the EU AI Act.

The Cost of Waiting Is Measured in Breaches, Fines, and Permanent Damage

The financial case for AI data governance is unambiguous. The average data breach costs $4.88 million. In healthcare, that figure reaches $10.93 million (IBM Cost of a Data Breach Report, 2024). EU AI Act fines for high-risk violations can reach €35 million or 7% of global annual revenue. GDPR penalties hit €20 million or 4%. Software supply chain attack losses are projected at $60 billion industry-wide.

But the most damaging cost may be the hardest to quantify: intellectual property permanently embedded in public AI training sets. Once proprietary data enters a public model, it cannot be retrieved, deleted, or controlled. The competitive damage is irreversible.

Organizations that move now to implement data-layer AI governance don’t just reduce risk. They gain a competitive advantage — the ability to adopt AI faster, more confidently, and with the documented compliance evidence that regulators, auditors, and customers increasingly require.

Three Actions Every Organization Should Take Now

First, get visibility into what’s already happening. You cannot govern what you cannot see. Deploy monitoring that captures AI-data interactions across all channels — not just web traffic, but file sharing, email, APIs, and agent workflows. Identify where sensitive data is flowing to AI systems and whether those flows are authorized, logged, and compliant. If 44% of organizations admit they lack adequate oversight, assume yours might be among them until you can prove otherwise.

Second, extend Zero Trust to every AI agent. Treat every AI agent as a non-human identity that requires the same authentication, authorization, and access controls as a human user — with the additional safeguards that machine-speed operations demand. Implement least-privilege access. Sandbox agent execution. Monitor for anomalous behavior. Ensure that no agent can access data beyond what its specific task requires.

Third, enable AI with governance built in — don’t block it. The organizations that try to prohibit AI use entirely will lose that battle. Employees will find workarounds, and shadow AI will proliferate. The sustainable approach is infrastructure that lets employees use AI tools productively while ensuring sensitive data never leaves the protected environment. Governance that happens automatically, behind the scenes, without impeding productivity, is the only model that scales.

The enterprise AI revolution isn’t coming. It’s here. The question is whether your organization will govern it — or be governed by its consequences.

To learn how Kiteworks can help, schedule a custom demo today.

Frequently Asked Questions

Traditional AI chatbots respond to prompts within a confined interface and don’t take independent actions. AI agents are fundamentally different — they reason, plan, and act autonomously across enterprise systems, querying databases, calling APIs, executing multi-step processes, and transmitting data with minimal human oversight. That autonomy is what creates security risk. An agent isn’t just answering a question; it’s operating as a privileged non-human identity with access to sensitive data at speeds and scales that human-focused security controls were never designed to handle. When compromised or manipulated, an agent can exfiltrate thousands of records before any alert fires.

Indirect prompt injection embeds hidden instructions in content an AI agent processes — a document, image, or web page — that override the agent’s original programming. When the agent processes the content, it executes the attacker’s instructions instead. Trend Micro demonstrated this causes data exfiltration with zero user interaction. ArXiv researchers built a working exploit that caused a RAG-based agent to retrieve internal secrets and transmit them to an attacker’s server. Traditional security tools — DLP, firewalls, endpoint protection — cannot stop this because the exfiltration happens through legitimate, authorized channels the agent was designed to use.

Shadow AI refers to employees using unsanctioned AI tools — personal accounts, free-tier products, browser extensions — without organizational knowledge or controls. Research shows 98% of companies have employees using unsanctioned applications, and the average organization experiences 223 AI-related data policy violations per month. The exposure is permanent: when employees upload source code, medical records, financial data, or intellectual property to a public AI model, that data may be incorporated into the model’s training set. It cannot be retrieved, deleted, or controlled afterward. Blocking AI entirely doesn’t solve this — it drives usage underground, creating invisible exposure instead of managed exposure.

Zero Trust for AI agents follows the same principles applied to human users, with additional safeguards for machine-speed operations. Every agent is treated as a distinct identity requiring explicit authentication and authorization. Access is scoped to the minimum data required for each specific task — not broad repository access. Every data interaction is logged in an immutable audit trail. Anomaly detection runs at machine speed to flag unusual behavior like bulk data requests or transmissions to unexpected destinations. And outbound data flows are governed so compromised agents cannot exfiltrate data through authorized channels. Role-based and attribute-based access controls must extend explicitly to non-human identities — most IAM systems weren’t built with agents in mind.

Several major frameworks now intersect directly with AI agent governance. The EU AI Act imposes fines up to €35 million or 7% of global annual revenue for high-risk violations and requires documented human oversight and data traceability. GDPR requires lawful basis and appropriate safeguards for any personal data processing, including by automated agents. HIPAA requires access controls and audit trails for every system that touches protected health information. CMMC requires FedRAMP-authorized solutions for federal contractors handling controlled unclassified information. GLBA mandates safeguards over financial data access. With 59 new data privacy regulations enacted in the past year alone, AI governance has moved from a best practice to a compliance requirement across industries.

Additional Resources