TL;DR

Anthropic has published lessons from using hundreds of reusable Claude Code Skills across its engineering organization. The confirmed development is a June 3, 2026 Claude blog post by engineer Thariq Shihipar describing Skills as folders containing instructions, scripts, references, templates and hooks, not just saved prompts.

Anthropic has detailed how its engineering organization uses hundreds of Claude Code Skills, arguing that reusable skill folders can turn repeated AI-agent prompting into shared, versioned operating procedures for software teams.

The confirmed development is a June 3, 2026 Claude blog post, Lessons from building Claude Code: How we use skills, by Anthropic engineer Thariq Shihipar. The post describes a Skill as a discoverable folder that can include SKILL.md instructions, reference files, scripts, templates, configuration, hooks and memory, according to the source material.

Anthropic’s central distinction is that a Skill is not only a saved prompt. It is a package the agent can read and use as needed, including by running code stored in the folder. The company says that structure lets teams move repeated guidance, review procedures, runbooks and product-specific checks into reusable assets.

According to the write-up, Anthropic grouped its internal Skills into nine categories, including API references, product verification, data analysis, business-process automation, scaffolding, code review, deployment, runbooks and infrastructure operations. The source material says verification Skills, which check an agent’s work, had the largest measured effect on output quality.

At a glance

reportWhen: Anthropic published the source blog on…

The developmentAnthropic published a Claude Code engineering write-up explaining what it learned from running hundreds of reusable Skills across its own organization.

AI Dispatch · Insights · 1 July 2026

A Skill is a folder, not a prompt

Q: What did Anthropic announce?

Anthropic published lessons from using hundreds of Claude Code Skills across its engineering organization. The post explains how Skills package instructions, scripts, references and templates into reusable folders.

Q: How is a Skill different from a prompt?

A prompt is usually text entered for one task. A Skill, as described by Anthropic, is a folder the agent can discover, read and use, including by running scripts and consulting reference files.

Q: Which Skill category had the biggest impact?

According to the source material, product verification Skills had the strongest measured effect on output quality. These Skills check an agent’s work rather than only telling it how to produce the first draft.

Q: What remains unknown?

The provided material does not disclose the full measurement methodology, all internal examples or how the results vary across teams. It also remains unclear how broadly the same gains apply outside Anthropic.

Anthropic published what it learned running hundreds of Skills across its own engineering org. Read as a business memo, the point is bigger than a coding trick: this is how ad-hoc prompting becomes durable institutional capability — the SOPs your agents actually follow, versioned and shared.

✕ The misconception

“A Skill is just a clever markdown prompt you save in a file.”

✓ What it actually is

A folder the agent can discover, read & run — instructions, scripts, references, templates, config & on-demand hooks.

Anatomy of a Skill — the file system is context engineering

my-skill/the unit you share & version

├─ SKILL.mdroot instructions + a description written for the model (its trigger)

├─ references/deep detail pulled in only when needed — progressive disclosure

├─ scripts/real code, so the agent composes instead of rebuilding boilerplate

├─ assets/templates & files to copy into the output

├─ config.jsonsetup the agent asks for if it’s missing (e.g. which Slack channel)

└─ hooks + memoryon-demand guardrails + an append-only log so it remembers

Why it matters: the folder itself is the knowledge base. The agent reads the root, then reaches deeper only when the task demands it — the same way you’d hand a new hire a one-pager that points to the detailed docs.

The nine types — a gap-analysis map for your own library

1Library / API reference

2Product verification ★ top impact

3Data fetching & analysis

4Business-process automation

5Code scaffolding & templates

6Code quality & review

7CI/CD & deployment

8Runbooks

9Infrastructure operations

By Anthropic’s own measurement, verification Skills — the ones that check the work — moved output quality the most. If you build one category well, build that one.

The craft — what separates a good Skill from a useless one

Gotchas = highest-signal section Describe for the model, not humans (it’s the trigger) Don’t state the obvious Ship scripts, not just prose On-demand guardrail hooks (/careful, /freeze) Let it remember (log / SQLite) Don’t railroad — leave room to adapt

The take

The knowledge of how your organization actually operates can be captured, versioned, shared & executed — and the thing capturing it is a humble folder with a script and a gotchas list inside. For the builder, that’s context engineering with real tools attached. For whoever owns the budget, it’s the difference between AI that starts from zero every morning and an asset that compounds. Caveats: best practices are still evolving, checked-in Skills cost context, and curation beats accumulation. Start with one Skill, one gotcha, and the category that catches your mistakes.

Source: “Lessons from building Claude Code: How we use skills,” Thariq Shihipar (Anthropic), Claude blog, 3 June 2026. Categories, examples & measured claims are Anthropic’s; framing is the author’s. Docs: code.claude.com/docs/en/skills.

thorstenmeyerai.com

Reusable Procedures For AI Agents

The report matters because many organizations still rely on repeated human prompting to steer coding agents. Anthropic’s approach points to a different model: capture the work instructions once, keep them in a shareable folder, and let the agent pull in deeper material only when the task calls for it.

For engineering leaders, the practical implication is that agent performance may depend less on one-off prompting skill and more on maintained internal libraries. Anthropic’s framing suggests Skills can support consistency, onboarding and quality control, especially where teams need agents to follow company-specific procedures rather than generic coding advice.

Claude AI for Beginners Bible: [5 in 1] The Ultimate Guide to Automate Your Work, Save Hours Every Week, and Use AI for Real-World Results

As an affiliate, we earn on qualifying purchases.

From Prompting To Versioned Folders

Anthropic’s post fits into a wider shift in AI coding tools toward context engineering: giving agents structured access to the files, scripts and rules they need to complete a task. In the Skills model described here, the folder itself becomes the knowledge base.

The source material says an effective Skill starts with a concise SKILL.md file that describes when the model should use it. Deeper references, scripts and templates can live alongside it, allowing the agent to retrieve more detail only when needed. That design is meant to reduce clutter while still giving the agent access to specific institutional knowledge.

The July 1, 2026 Thorsten Meyer AI analysis frames the development as a business issue rather than only a developer technique: procedures that once lived in memory, chat threads or scattered documentation can become versioned, shared and executable.

AI Prompt Engineering: Foundations of Communication with LLMs – Building Generative AI and Agentic AI Prompt Systems Across Development, Testing, and Deployment (AI Engineering)

As an affiliate, we earn on qualifying purchases.

Limits Of The Evidence

Several details remain unclear from the provided source material. Anthropic is said to have used hundreds of Skills, but the material does not provide the full dataset, the measurement method for output quality, or the size and composition of the engineering teams involved.

It is also not yet clear how well the same approach transfers to smaller teams, non-Anthropic codebases or organizations with less mature internal documentation. The source material says best practices are still developing and that curation beats accumulation, meaning a large Skill library could become harder to manage if teams add folders without maintaining them.

Mastering Google Gemini for Apps Script: Zero-Code Automation: A Practical Guide to Reclaiming Your Time with Gemini and GAS (Google Gemini Mastery Series Book 7)

As an affiliate, we earn on qualifying purchases.

Teams Test Skill Libraries

The next step for readers using AI coding agents is likely experimentation with small, focused Skills, especially for review and verification. The source material recommends starting with one Skill, one hard-earned caveat and the category that most often catches mistakes.

Anthropic’s documentation at code.claude.com/docs/en/skills is the cited technical reference. For now, the main development is not a new model release but a clearer operating pattern: teams can treat agent instructions, scripts and checks as software assets that are versioned and improved over time.

Pydantic AI for Automation Workflows: Build Typed, Reliable, and Production-Ready AI Automations in Python

As an affiliate, we earn on qualifying purchases.

Key Questions

What did Anthropic announce?

Anthropic published lessons from using hundreds of Claude Code Skills across its engineering organization. The post explains how Skills package instructions, scripts, references and templates into reusable folders.

How is a Skill different from a prompt?

A prompt is usually text entered for one task. A Skill, as described by Anthropic, is a folder the agent can discover, read and use, including by running scripts and consulting reference files.

Which Skill category had the biggest impact?

According to the source material, product verification Skills had the strongest measured effect on output quality. These Skills check an agent’s work rather than only telling it how to produce the first draft.

What remains unknown?

The provided material does not disclose the full measurement methodology, all internal examples or how the results vary across teams. It also remains unclear how broadly the same gains apply outside Anthropic.

Why should engineering teams care?

The approach could help teams reduce repeated prompting and preserve institutional knowledge in a format agents can apply. That may improve consistency, onboarding and quality checks for AI-assisted software work.

Source: Thorsten Meyer AI

A Skill Is a Folder, Not a Prompt: What Anthropic Learned Running Hundreds of Them

Up next

Florida woman swimming in river killed in rare alligator attack at state forest

Author

The Genius Factory Team

Share article

A Skill is a folder, not a prompt

Reusable Procedures For AI Agents

Claude AI for Beginners Bible: [5 in 1] The Ultimate Guide to Automate Your Work, Save Hours Every Week, and Use AI for Real-World Results

From Prompting To Versioned Folders

AI Prompt Engineering: Foundations of Communication with LLMs – Building Generative AI and Agentic AI Prompt Systems Across Development, Testing, and Deployment (AI Engineering)

Limits Of The Evidence

Mastering Google Gemini for Apps Script: Zero-Code Automation: A Practical Guide to Reclaiming Your Time with Gemini and GAS (Google Gemini Mastery Series Book 7)

Teams Test Skill Libraries

Pydantic AI for Automation Workflows: Build Typed, Reliable, and Production-Ready AI Automations in Python