AI_SAFETYarxiv_cscr14 May 2026

arXiv: Talk is (Not) Cheap: A Taxonomy and Benchmark Coverage Audit for LLM Attacks

AI_SAFETY. Sourced from arxiv_cscr, summarised by Matproof.

AI Analysis

What changed and what to do.

This publication, a pre-print from arXiv dated May 14, 2026, introduces a new taxonomy and benchmark coverage audit for attacks on large language models (LLMs). It systematically categorises the types of adversarial inputs that can cause LLMs to produce harmful, biased, or non-compliant outputs, and then evaluates how well existing safety benchmarks cover these attack vectors. The core finding is that current testing frameworks significantly under-represent many real-world attack categories, meaning organisations relying solely on standard safety benchmarks may have a false sense of security.

The primary affected groups are any organisations deploying or integrating LLMs into customer-facing or regulated processes, particularly in finance, healthcare, legal services, and public administration. Under the EU AI Act, these entities are classified as providers or deployers of high-risk AI systems and must demonstrate robust risk management and testing. The audit reveals gaps in current red-teaming and evaluation practices that could lead to non-compliance with requirements for accuracy, robustness, and transparency.

Compliance teams should immediately review their current LLM safety testing protocols against the taxonomy presented in this paper. They should identify which attack categories are not covered by their existing benchmarks and update their testing suites accordingly. It is also prudent to document this gap analysis as part of the technical documentation required under the AI Act, and to schedule a re-evaluation of third-party model providers to ensure their safety claims align with this more comprehensive attack coverage.

View original at arxiv_cscr →

This summary is AI-generated for orientation purposes. For regulatory action, always consult the original source linked above.

More AI_SAFETY updates

Latest in AI_SAFETY.

arxiv_cscr26 Jun 2026

arXiv: GTI-mSEMP Framework : A Proposed Framework to Stimulate Malware Propagation with Inclusion of Attacker-Defender Strategy

A new preprint published on arXiv proposes a framework called GTI-mSEMP, which models how malware could be deliberately stimulated to spread more effectively by incorporating attacker and defender…

arxiv_cscr26 Jun 2026

arXiv: ToolPrivacyBench: Benchmarking Purpose-Bound Privacy in Tool-Using LLM Agents

This paper, ToolPrivacyBench, introduces a new benchmarking framework designed to evaluate how well large language model agents protect user privacy when using external tools. It specifically tests…

arxiv_cscr26 Jun 2026

arXiv: Ghost Without Shell: Measuring Non-Interactive SSH Attacks on Honeypots

This paper, published on arXiv, presents a novel measurement study of non-interactive SSH attacks against honeypots, which are decoy systems used to detect cyber threats. The research reveals that a…

arxiv_cscr26 Jun 2026

arXiv: Quantum Multi-Party Threshold Private Set Intersection with Explicit Cardinality Testing

This publication introduces a novel cryptographic protocol for quantum multi-party threshold private set intersection with explicit cardinality testing. It enables multiple parties to compute the…

← Back to all updates

Live regulatory monitoring

Never miss a compliance update.

Get weekly digests of DORA, NIS2, GDPR, MaRisk, and ISO 27001 changes — straight to your inbox. Free.

No spam. Weekly digest only. Unsubscribe anytime.

DORANIS2GDPRMaRiskISO 27001

Map this to your controls

Connect regulatory changes to your compliance work.

Matproof maps every regulator update directly to your controls and surfaces the ones that affect your organisation — across 21 frameworks.

Book a Demo Browse all updates