TL;DR

Anthropic mapped a year of AI-linked malicious cyber activity across 832 banned accounts and found that counting attacker techniques is no longer a strong gauge of risk. The analysis says systems that let AI chain attack steps with little human input are a better warning sign, but standard taxonomies do not yet capture that behavior.

Anthropic has reported that a year of AI-enabled cyber misuse across 832 banned accounts shows a gap in how security teams judge attacker capability, saying traditional measures such as technique counts no longer reliably separate high-risk actors from less capable ones.

The analysis, described by Thorsten Meyer AI and attributed to Anthropic’s Frontier Red Team, mapped malicious activity from March 2025 through March 2026 onto the MITRE ATT&CK framework. The dataset is described as a detailed window into cases with enough information to evaluate techniques, not a full count of all AI-enabled cyber misuse.

According to the source material, 67.3% of the accounts, or 560, used AI to help write malware. Another 6.5%, or 54 accounts, used AI for lateral movement inside networks. The share of actors rated medium risk or higher rose from 33% in the first six months to 56% in the second half of the period.

The report’s central finding is that technique count lost much of its value as a risk signal. The source says the least-skilled actors used 16 techniques while the most-skilled used 20, a narrow gap that would have been more meaningful before AI tools could supply attack methods. The platform used, including Claude Code, API access or chat, did not correlate with risk, according to the account of the analysis.

ThorstenMeyerAI.com
AI & Security · Field Note
AI-enabled cyber threats · a year mapped

The frameworks can’t see the thing that matters

For decades, danger meant which techniques an attacker commands. A year of real AI-enabled attacks — 832 banned accounts mapped onto MITRE ATT&CK — shows that signal breaking, just as a new, harder-to-see one takes over.

Anthropic Frontier Red Team · Mar 2025–Mar 2026 · 832 accounts · via Verizon DBIR
01The dataset

A year of real misuse, mapped to the standard taxonomy

A window, not a census — these are the cases with enough detail to assess techniques thoroughly. Inside it, the risk level climbed fast.

WHAT WAS STUDIED

832 accounts
Banned for malicious cyber activity, Mar 2025–Mar 2026, mapped onto MITRE ATT&CK. The most common AI use was prep — 67.3% (560) used AI to help write malware; 6.5% (54) for lateral movement deep inside networks.

THE RISK CLIMB · MEDIUM-OR-HIGHER ACTORS

First 6 months33%
33%
Second 6 months56%
56%
≈ 1.7× increase in a single year
02The measurement breaks · press play
Amazon

AI malware detection tools

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Artificial Intelligence for Cybersecurity: How AI Detects Cyber Threats, Prevents Hacking, and Protects Your Data, Identity, and Smart Devices (AI Cybersecurity Mastery Series)

Artificial Intelligence for Cybersecurity: How AI Detects Cyber Threats, Prevents Hacking, and Protects Your Data, Identity, and Smart Devices (AI Cybersecurity Mastery Series)

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

“More techniques” stopped meaning “more dangerous”

The old heuristic: count the techniques, judge the tooling. AI dissolved it — because the model supplies the techniques either way. Watch the old signal fail, then watch what it misses.

Risk score vs. technique count

Two ways to read the same attacker. One is going blind. Press play.

the old signalSkill ≈ number of techniques?
Least-skilled
16
Most-skilled
20
16 vs. 20. A novice and an expert now look almost alike by technique-count — and the platform (Claude Code / API / chat) didn’t correlate with risk either.
what it missesThe Nov 2025 espionage operation
by technique count
30
techniques · 13 tactics
Looks like many medium-risk actors. Unremarkable.
by risk-scoring methodology
100
max risk score
The model ran as an autonomous agent — same case.
The most dangerous attribute of the year’s most dangerous attack is taxonomically invisible. ⌁ there is no MITRE ATT&CK ID for agentic orchestration
03Where the AI moved
Amazon

cyber threat intelligence software

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Practical Malware Analysis: The Hands-On Guide to Dissecting Malicious Software

Practical Malware Analysis: The Hands-On Guide to Dissecting Malicious Software

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Deeper into the attack — and into less-skilled hands

Across the year, AI use drifted from getting in toward acting once already inside — the operationally demanding stages that used to require an expert.

The attack lifecycle · where AI is now applied

The center of gravity moved right — toward post-compromise work.

Initial access
phishing, getting in
Account discovery
finding valid accounts
Lateral movement
navigating the network
Privilege escalation
deeper control
↓ 8.6%
AI-assisted phishing
A classic way to gain access — falling.
↑ 8.9%
AI for account discovery
Post-compromise work — rising.
The crack in the old model: post-compromise techniques used to be restricted to actors skilled enough to perform them. AI can now perform them on behalf of less sophisticated actors — the dangerous deep stages are no longer self-limiting.
04What actually predicts danger now
Amazon

network security monitoring devices

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Network Intrusion Detection

Network Intrusion Detection

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

From “what they know” to “what they’ve built”

The report sorts the signals into three tiers — one dead, one fading, one durable.

🔢

Technique count & tooling

16 vs. 20 between novice and expert; platform doesn’t correlate. The model supplies the techniques either way.

dead signal
📍

Where in the lifecycle AI is applied

Concentrating on operationally demanding, post-compromise stages is a better signal — but it’s eroding as the whole population heads there.

fading signal
🏗️

The scaffolding around the model

Architectures that let the model chain stages and run with minimal human input. Not what they know — whether they’ve built a system that lets AI run the attack.

durable signal
05What follows · read straight
Amazon

AI-powered intrusion detection system

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Operationalizing Threat Intelligence: A guide to developing and operationalizing cyber threat intelligence programs

Operationalizing Threat Intelligence: A guide to developing and operationalizing cyber threat intelligence programs

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Fixing the map before the territory moves again

A taxonomy that can’t name the most dangerous behavior on the field will quietly mislead the people relying on it. The response runs in two directions.

🛡️ defensively

Fed back into the models

The findings informed safeguards on the most capable models, built to detect & block some of what was observed:

  • Blocking malware development
  • Blocking mass data exfiltration
  • Putting tools in defenders’ hands first (Project Glasswing)
🧭 institutionally

Taking it to the source

Following the Verizon work, Anthropic says it’s in discussions with MITRE about how ATT&CK might evolve:

  • A vocabulary for agentic orchestration
  • Naming the scaffolding that makes a model an operator
  • An interactive technique visualization on the Red blog

Reading it in proportion

  • The 832 cases are a detailed subset, not the full population — the precise percentages are directional, not definitive.
  • “More autonomous” is not “fully autonomous” — even the standout case needed human input at key moments, which is itself a place for defenders to intervene.
  • This is one vendor’s window — the company with visibility into misuse of its own model, publishing what it found. The right thing to do with the data, and worth remembering as you read it.
ThorstenMeyerAI.com
Source: Anthropic, “What we learned mapping a year’s worth of AI-enabled cyber threats” (Jun 3, 2026) · Frontier Red Team · Verizon 2026 DBIR · figures per the report · independent commentary · findings only, no operational detail.

Why It Matters

The findings matter because many security teams use frameworks such as MITRE ATT&CK to classify behavior, compare threat actors and decide where to focus defenses. If AI allows lower-skilled actors to perform complex post-compromise tasks, defenders may misread risk when they rely mainly on how many techniques an actor appears to know.

The more durable warning sign, according to the analysis, is the scaffolding around the model: systems that let AI chain stages of an intrusion and operate with limited human direction. That behavior can make an actor more dangerous even when the visible technique list looks ordinary.

Background

MITRE ATT&CK is widely used to describe adversary tactics and techniques across the attack lifecycle. The Anthropic-linked analysis says AI use moved over the year from access-oriented activity, such as phishing, toward post-compromise work including account discovery, lateral movement and privilege escalation.

The source material says AI-assisted phishing fell by 8.6% while AI use for account discovery rose by 8.9%. It also cites a November 2025 espionage operation that used 30 techniques across 13 tactics but received a maximum risk score of 100 because the model operated as an autonomous agent.

What Remains Unclear

It is not clear how representative the 832 banned accounts are of the broader threat landscape because the dataset covers cases with enough detail to map, not all malicious AI use. The source material does not establish how often agentic orchestration appears outside Anthropic’s enforcement data, nor does it show whether other AI providers are seeing the same pattern at the same scale.

What’s Next

Anthropic says the findings have informed safeguards for its most capable models, including efforts aimed at blocking malware development and mass data exfiltration. The source material also says Anthropic is discussing with MITRE how ATT&CK might evolve to describe agentic orchestration and the surrounding systems that turn a model into an operator.

Key Questions

What was the actual news development?

Anthropic analyzed 832 banned accounts tied to malicious cyber activity over a one-year period and found that standard threat measures, especially technique counts, may miss the risk created by AI-driven attack orchestration.

Is this a full census of AI-enabled cyber threats?

No. The source describes the dataset as a window into cases with enough detail to assess techniques, not a complete count of all malicious AI-enabled activity.

What is confirmed versus claimed?

The reported dataset size, time period and mapped activity categories are presented as findings from Anthropic’s analysis. The broader conclusion that older risk signals are breaking is an interpretation drawn from those findings.

Why does agentic orchestration matter?

It refers to systems that let AI connect multiple attack stages with less human control. The analysis says that capability, rather than the raw number of techniques used, may better identify high-risk actors.

What remains unresolved?

It remains unclear whether MITRE ATT&CK will add new categories for this behavior, how defenders will measure it in daily operations and how common the pattern is across the wider market.

Source: Thorsten Meyer AI

You May Also Like

Dems replace ‘mother’ with ‘gestating parent’ in latest woke rewrite of NY law

A new bill in New York replaces ‘mother’ and ‘father’ with gender-neutral terms like ‘gestating parent,’ sparking controversy among lawmakers and critics.

The European Bet: How Mistral, Aleph Alpha, and Black Forest Labs Are Playing a Different Game

European AI firms Mistral, Aleph Alpha, and Black Forest Labs are positioning for the EU AI Act’s enforcement, emphasizing compliance and sovereign deployment over frontier capabilities.

Rebrandable client delivery dashboard for AI agencies

A new rebrandable client delivery dashboard for AI agencies is set for initial testing, aiming to improve client transparency and agency professionalism.

Every Benchmark Launched 2023-2024 Has Fallen — The METR / SWE-Bench / CORE-Bench / MLE-Bench / PostTrainBench Sequence

Every key AI research benchmark launched in 2023-2024 has either saturated or is nearing saturation, indicating rapid progress in AI capabilities.