Prompt injection is exploiting enterprise AI's biggest design flaws by targeting agents, RAG pipelines and model routers

Read full story on VentureBeat
Share
Prompt injection is exploiting enterprise AI's biggest design flaws by targeting agents, RAG pipelines and model routers
AI disclosure

Summary

<p>In the past two years, businesses have been trying to fit large language models (LLMs) into support, analytics, development, and internal automation like never before. </p><p>Along with the increasing adoption of <a href="https://venturebeat.com/technology/agentic-ai-solved-coding-and-exposed-every-other-problem-in-software-engineering">AI technology</a>, another trend is gaining momentum — cybercriminals are taking advantage of the disconnect between assumptions about LLMs and their actual characteristics.</p><p>In 2025 and 2026, several independent sources have highlighted the same trend: Prompt injection remains one of the most impactful and widely demonstrated attack vectors against LLM systems. The <a href="https://genai.owasp.org/llm-top-10/">OWASP LLM Top 10</a> (2025) lists prompt injection as LLM01, identifying it as the most critical category of LLM‑specific vulnerabilities, for the <a href="https://owasp.org/www-project-top-10-for-large-language-model-applications/assets/PDF/OWASP-Top-10-for-LLMs-v2025.pdf">second consecutive edition</a>. OWASP&#x27;s ranking reflects the fact that LLMs still struggle to reliably separate instructions from data, making them susceptible to manipulation through crafted inputs.</p><p>CrowdStrike&#x27;s 2026 <a href="https://www.crowdstrike.com/en-us/press-releases/2026-crowdstrike-global-threat-report/">Global Threat Report</a> — built on frontline intelligence across more than 280 tracked adversaries — documented that threat actors injected malicious prompts into legitimate generative AI tools at more than 90 organizations in 2025. They then used those injections to generate commands that stole credentials and cryptocurrency. The report stated it plainly: <i>&quot;Prompts are the new malware.&quot;</i> AI-enabled adversaries increased their overall attack volume by 89% year-over-year, with prompt injection working as both an entry point and a force multiplier.</p><p>Real‑world incidents illustrate the operational impact. In August 2024, <a href="https://promptarmor.substack.com/p/data-exfiltration-from-slack-ai-via">researchers at PromptArmor</a> disclosed a prompt injection vulnerability in Slack AI that allowed an attacker to exfiltrate data from private Slack channels they had no access to — including API keys shared in private developer channels — by placing a malicious instruction in a public channel or embedding it in an uploaded document. </p><p>In June 2025, <a href="https://www.aim.security/lp/aim-labs-echoleak-blogpost">researchers at Aim Security</a> disclosed EchoLeak (CVE-2025-32711, CVSS 9.3), the first documented zero-click prompt injection exploit against a production AI system, targeting Microsoft 365 Copilot. By sending a single crafted email, no user interaction required, an attacker could cause Copilot to access internal files and transmit their contents to an attacker-controlled server. </p><p>Both vulnerabilities <a href="https://arxiv.org/abs/2509.10540">were patched</a>. These incidents underscore the fact that prompt injection is not a theoretical weakness but a practical, repeatable threat organizations must address as they deploy AI systems at scale.</p><p>Prompt injection techniques have undergone major evolutions over recent years, now targeting multi-agent architecture, retrieval-augmented generation (RAG) pipelines, model routers, and long-term memory capabilities.</p><h2>The e<b>nterprise challenge: Too much trust </b></h2><p>Businesses <a href="https://venturebeat.com/orchestration/mcp-solved-tool-calling-a2a-solved-coordination-what-solves-transport">deploy LLMs</a> to process instructions, summarize information, and trigger automated workflows, but it is difficult for LLMs to tell:</p><ul><li><p>I<!-- -->nstructions from data</p></li><li><p>I<!-- -->nformation from context</p></li><li><p>C<!-- -->ontext from metadata</p></li><li><p>U<!-- -->ser intent from metadata</p></li></ul><p>This creates an opportunity for attackers to manipulate and influence the model&#x27;s behavior, either directly or indirectly.</p><h2><b>Modern prompt injection</b></h2><p><b>Cross-model prompt injection</b></p><p>LLM use is a common practice among enterprises. Attackers corrupt the output of a particular model, knowing well that other models would be processing the content. Hence, the corruption propagates through all AI systems.</p><p><b>RAG supply chain poisoning</b></p><p>A<!-- -->ttackers create malicious information — documentation, blog articles, GitHub READMEs. Then they wait until this malicious information is ingested in enterprises&#x27; RAG pipelines, then use it as an attack vector.</p><p><b>Agent hijacking</b></p><p><a href="https://venturebeat.com/security/claude-mythos-exposed-a-hard-truth-your-enterprise-patching-process-is-way-too-slow">AI agents</a> have evolved to the point where they can send emails, modify cloud infrastructure, execute code snippets, and interact with internal corporate systems. It takes just a single instruction to make agents act differently in a harmful manner.</p><p><b>Context overflow attacks</b></p><p>With the help of million-token context windows, attackers place malicious code within the document and hope that an LLM will stumble upon it and execute it, thus overriding all previous instructions.</p><p><b>Memory poisoning</b></p><p>Due to the implementation of long-term memory in LLMs, attackers can inject instructions that permanently reconfigure their state.</p><p><b>Model‑router manipulation</b></p><p>Enterprises increasingly use model routers to select between multiple LLMs. Attackers craft prompts that force routing to the weakest or least‑guarded model.</p><h2><b>Why this matters for business leaders</b></h2><p>Prompt injection is not a theoretical problem. It directly affects:</p><ul><li><p>C<!-- -->ustomer‑facing systems (chatbots, support agents)</p></li><li><p>I<!-- -->nternal copilots (developer tools, security assistants)</p></li><li><p>A<!-- -->utomation workflows (ticketing, cloud operations, HR processes)</p></li><li><p>D<!-- -->ata governance (RAG pipelines, knowledge bases)</p></li></ul><p>The risk is no longer limited to &quot;the model said something it shouldn&#x27;t.&quot;</p><p>In 2026, prompt injection can:</p><ul><li><p>T<!-- -->rigger unauthorized actions</p></li><li><p>L<!-- -->eak sensitive data</p></li><li><p>C<!-- -->orrupt internal workflows</p></li><li><p>M<!-- -->anipulate analytics</p></li><li><p>A<!-- -->lter business logic</p></li><li><p>C<!-- -->ompromise multi‑agent systems</p></li></ul><p>The attack surface has expanded dramatically.</p><h2><b>What enterprises should do now</b></h2><p><b>1. Constrain model permissions</b></p><p>Limit what the model can do, not just what it should do.</p><p><b>2. Segment untrusted content</b></p><p>Treat all external data — including RAG sources — as potentially hostile.</p><p><b>3. Monitor tool invocation</b></p><p>Require human approval for high‑impact actions.</p><p><b>4. Validate content provenance</b></p><p>Ensure RAG pipelines don&#x27;t ingest poisoned external content.</p><p><b>5. Harden model routers</b></p><p>Prevent attackers from forcing routing to weaker models.</p><p><b>6. Treat LLMs as untrusted components</b></p><p>This mindset shift is the foundation of modern AI security.</p><h2><b>The bottom line</b></h2><p>Prompt injection remains the most effective way to compromise enterprise AI systems because it exploits the fundamental way LLMs interpret text. Until organizations treat LLMs as untrusted interpreters — not autonomous decision‑makers — prompt injection will continue to dominate the AI threat landscape.</p><p><i>Julie Brunias is an AI Security Architect.</i></p>

Original reporting

Open original source

Related coverage

Read full article on VentureBeat

Get the AFBytes Brief

Major stories, AI-assisted analysis, and what to watch next. Free, monthly, unsubscribe anytime.