AI_SAFETYarxiv_cscr4 Jun 2026

arXiv: WebMCP Tool Surface Poisoning: Runtime Manipulation Attacks on LLM Agents

AI_SAFETY. Sourced from arxiv_cscr, summarised by Matproof.

AI Analysis

What changed and what to do.

A new research paper published on arXiv, titled "WebMCP Tool Surface Poisoning: Runtime Manipulation Attacks on LLM Agents," identifies a novel vulnerability in large language model (LLM) agents that use the Model Context Protocol (MCP) to interact with external tools. The study demonstrates how attackers can poison the tool surface—the metadata and descriptions that guide an LLM's tool selection—at runtime, causing the agent to misuse or bypass security controls. This is not a software patch but a disclosure of a new attack vector that exploits how LLMs interpret tool definitions, potentially leading to unauthorized data access or actions.

This finding directly affects any organization deploying LLM agents that rely on MCP or similar dynamic tool-calling frameworks, particularly in regulated sectors such as finance, healthcare, and legal services. Compliance teams in these industries must assess whether their AI systems use runtime tool descriptions that could be manipulated by external inputs or untrusted data sources. The risk is highest for agents that process user-provided content or interact with third-party APIs without strict validation.

Compliance teams should immediately review their AI system architectures to determine if tool surfaces are dynamically generated or modifiable at runtime. They should implement static, immutable tool definitions where possible, and add integrity checks to verify tool metadata before each LLM call. Additionally, teams should update their AI risk registers to include this attack vector and coordinate with security teams to test for tool surface poisoning in their current deployments. Monitoring for updates from the EU AI Office on this specific vulnerability is also recommended.

View original at arxiv_cscr →

This summary is AI-generated for orientation purposes. For regulatory action, always consult the original source linked above.

More AI_SAFETY updates

Latest in AI_SAFETY.

arxiv_cscr4 Jun 2026

arXiv: Will the Agent Recuse Itself? Measuring LLM-Agent Compliance with In-Band Access-Deny Signals

This paper, published on arXiv, presents a study on whether large language model (LLM) agents will comply with in-band access-deny signals—essentially, instructions embedded in a system’s output that…

arxiv_cscr4 Jun 2026

arXiv: Robust Ensemble of Selectively Strengthened and Augmented Predictors

This paper, published on arXiv, proposes a new technical framework called "Robust Ensemble of Selectively Strengthened and Augmented Predictors" (RESSAP) for improving the safety and reliability of…

arxiv_cscr4 Jun 2026

arXiv: SecRL-Prune: Structured Reinforcement Learning-Based Pruning of CodeLLMs for Preserving Adversarial Code Mutation

This paper, published on arXiv, introduces SecRL-Prune, a new technical framework for pruning large language models used in code generation. The method uses reinforcement learning to selectively…

arxiv_cscr4 Jun 2026

arXiv: Steering LLM Viewpoints through Fabricated Evidence Injection

A new preprint from arXiv, titled "Steering LLM Viewpoints through Fabricated Evidence Injection," demonstrates a novel attack vector against large language models. The research shows that by…

← Back to all updates

Live regulatory monitoring

Never miss a compliance update.

Get weekly digests of DORA, NIS2, GDPR, MaRisk, and ISO 27001 changes — straight to your inbox. Free.

No spam. Weekly digest only. Unsubscribe anytime.

DORANIS2GDPRMaRiskISO 27001

Map this to your controls

Connect regulatory changes to your compliance work.

Matproof maps every regulator update directly to your controls and surfaces the ones that affect your organisation — across 21 frameworks.

Book a Demo Browse all updates