TL;DR

Anthropic published a Claude Code engineering write-up on June 3, 2026 describing how it uses hundreds of reusable Skills across its engineering organization. The confirmed development is the company’s account of Skills as folders containing instructions, scripts, references and hooks, rather than saved prompts; the broader business impact is still being tested outside Anthropic.

Anthropic has published a Claude Code engineering report explaining how it uses hundreds of reusable Skills across its own engineering organization, presenting them as versioned folders of instructions, scripts and references rather than one-off prompts. The development matters because it points to a more durable way for companies to make AI coding agents follow shared operating practices instead of relying on repeated manual prompting.

The post, titled “Lessons from building Claude Code: How we use skills” and attributed to Thariq Shihipar of Anthropic, was published on the Claude blog on June 3, 2026. According to the source material, Anthropic says a Skill is structured as a folder that can include a SKILL.md file, references, scripts, assets, configuration, hooks and memory.

The confirmed technical point is definitional: Anthropic describes Skills as units that an agent can discover, read and run. The root instructions tell the model when to use the Skill, while supporting files can be loaded only when the task requires them. The source material frames that pattern as progressive disclosure, meaning the agent starts with a short instruction layer and then reaches into deeper material when needed.

Anthropic’s reported internal taxonomy groups Skills into nine categories: library or API reference, product verification, data fetching and analysis, business-process automation, code scaffolding and templates, code quality and review, CI/CD and deployment, runbooks, and infrastructure operations. The source material says verification Skills, which check an agent’s work, had the largest measured impact on output quality in Anthropic’s own use.

At a glance

reportWhen: published June 3, 2026; discussed in a…

The developmentAnthropic published a Claude Code post detailing lessons from running hundreds of reusable Skills across its own engineering organization.

AI Dispatch · Insights · 1 July 2026

A Skill is a folder, not a prompt

Anthropic published what it learned running hundreds of Skills across its own engineering org. Read as a business memo, the point is bigger than a coding trick: this is how ad-hoc prompting becomes durable institutional capability — the SOPs your agents actually follow, versioned and shared.

✕ The misconception

“A Skill is just a clever markdown prompt you save in a file.”

✓ What it actually is

A folder the agent can discover, read & run — instructions, scripts, references, templates, config & on-demand hooks.

Anatomy of a Skill — the file system is context engineering

my-skill/the unit you share & version

├─ SKILL.mdroot instructions + a description written for the model (its trigger)

├─ references/deep detail pulled in only when needed — progressive disclosure

├─ scripts/real code, so the agent composes instead of rebuilding boilerplate

├─ assets/templates & files to copy into the output

├─ config.jsonsetup the agent asks for if it’s missing (e.g. which Slack channel)

└─ hooks + memoryon-demand guardrails + an append-only log so it remembers

Why it matters: the folder itself is the knowledge base. The agent reads the root, then reaches deeper only when the task demands it — the same way you’d hand a new hire a one-pager that points to the detailed docs.

The nine types — a gap-analysis map for your own library

1Library / API reference

2Product verification ★ top impact

3Data fetching & analysis

4Business-process automation

5Code scaffolding & templates

6Code quality & review

7CI/CD & deployment

8Runbooks

9Infrastructure operations

By Anthropic’s own measurement, verification Skills — the ones that check the work — moved output quality the most. If you build one category well, build that one.

The craft — what separates a good Skill from a useless one

Gotchas = highest-signal section Describe for the model, not humans (it’s the trigger) Don’t state the obvious Ship scripts, not just prose On-demand guardrail hooks (/careful, /freeze) Let it remember (log / SQLite) Don’t railroad — leave room to adapt

The take

The knowledge of how your organization actually operates can be captured, versioned, shared & executed — and the thing capturing it is a humble folder with a script and a gotchas list inside. For the builder, that’s context engineering with real tools attached. For whoever owns the budget, it’s the difference between AI that starts from zero every morning and an asset that compounds. Caveats: best practices are still evolving, checked-in Skills cost context, and curation beats accumulation. Start with one Skill, one gotcha, and the category that catches your mistakes.

Source: “Lessons from building Claude Code: How we use skills,” Thariq Shihipar (Anthropic), Claude blog, 3 June 2026. Categories, examples & measured claims are Anthropic’s; framing is the author’s. Docs: code.claude.com/docs/en/skills.

thorstenmeyerai.com

From Prompts to Shared Assets

The main consequence for readers is that Anthropic is presenting agent guidance as organizational infrastructure, not just clever prompt writing. If the model’s working instructions live in folders with scripts, templates and checks, teams can share, revise and version that knowledge the same way they manage other engineering assets.

That could matter for companies trying to make AI coding agents more consistent. A Skill can bundle tribal knowledge, project-specific rules and repeatable checks in one place, according to the source material. In practice, that means a new engineer’s agent and a senior engineer’s agent could be pointed at the same approved process for tasks such as release checks, code review or product verification.

The business claim is broader than the confirmed technical description. The Thorsten Meyer AI analysis argues that Skills can become an appreciating asset because teams can improve them as new edge cases appear. That is an interpretation of Anthropic’s write-up, not an independent measurement of return on investment across the wider market.

Designing Instruction with Generative AI: 24/7 Support for Optimizing Teaching and Learning

As an affiliate, we earn on qualifying purchases.

How Anthropic Structures Skills

The source material says a typical Skill begins with SKILL.md, which contains root instructions and a description written for the model. That description acts as the trigger for when the agent should use the Skill. Supporting folders can contain references for deeper detail, scripts for repeatable work, assets such as templates, configuration files and optional hooks.

The analysis also highlights several craft lessons attributed to Anthropic’s experience: write descriptions for the model rather than humans, avoid stating obvious information, include scripts instead of prose alone, use guardrail hooks for sensitive workflows and allow room for the agent to adapt. The source material says the strongest Skills often start small, then improve as teams record gotchas and edge cases.

The July 1, 2026 dispatch from Thorsten Meyer AI reframes the Anthropic post as a business memo. Its central reading is that Skills turn repeated prompting into durable institutional capability. That framing goes beyond Anthropic’s how-to material, but it is grounded in the company’s reported internal use of hundreds of Skills.

Amazon

versioned code repository for AI projects

As an affiliate, we earn on qualifying purchases.

Outside Results Still Unproven

Several points remain open. Anthropic’s reported results come from its own engineering organization, and the source material does not provide a full public dataset showing how hundreds of Skills performed across different companies, codebases or agent setups. It is also unclear how much maintenance burden a large Skill library creates over time.

The analysis warns that best practices are still evolving, that checked-in Skills can consume model context, and that curation may matter more than accumulation. It is not yet clear which teams can justify spending an engineer-week on a single Skill category, or which categories produce the best returns outside Anthropic’s environment.

From Scripting To Systems: A Practical Guide to Using AI Workflows That Save Time, Reduce Errors, and Make You the Go-To Tech Expert

As an affiliate, we earn on qualifying purchases.

Teams Test Verification First

The next practical step for teams following Anthropic’s lead is likely to be narrow adoption rather than building large libraries immediately. The source material recommends starting with one Skill, one recurring gotcha and the category most likely to catch mistakes, especially verification.

Readers should watch whether Anthropic, Claude Code users or other AI toolmakers publish more evidence on quality gains, maintenance costs and patterns for sharing Skills across teams. The key test is whether folder-based Skills can make agent work more repeatable in real production settings, not only inside Anthropic’s internal workflows.

The AI Agent Blueprint: A Practical Playbook for Building Agentic Artificial Intelligence: Launch Your First Agent in 30 Days

As an affiliate, we earn on qualifying purchases.

Key Questions

What did Anthropic publish?

Anthropic published a Claude Code engineering post on June 3, 2026 describing lessons from using hundreds of Skills across its engineering organization.

What is a Skill in this report?

A Skill is described as a folder that can include instructions, scripts, references, templates, configuration, hooks and memory. It is not presented as only a saved prompt.

Which Skill category had the biggest reported effect?

According to the source material, Anthropic found that verification Skills, which check an agent’s work, had the largest measured effect on output quality in its own use.

Why does this matter for companies using AI agents?

The approach could let teams package repeatable procedures, project rules and review checks into shared assets that agents can apply across tasks. That may reduce repeated prompting and make agent behavior more consistent.

What is still unknown?

It remains unclear how well Anthropic’s internal results apply to other companies, how much work large Skill libraries require to maintain, and which use cases produce the strongest measurable benefits.

Source: Thorsten Meyer AI

A Skill Is a Folder, Not a Prompt: What Anthropic Learned Running Hundreds of Them

Up next

How to Choose Portable Bluetooth Speakers

Author

The Sound of Music Guide Team

Share article

A Skill is a folder, not a prompt

From Prompts to Shared Assets

Designing Instruction with Generative AI: 24/7 Support for Optimizing Teaching and Learning

How Anthropic Structures Skills

versioned code repository for AI projects

Outside Results Still Unproven

From Scripting To Systems: A Practical Guide to Using AI Workflows That Save Time, Reduce Errors, and Make You the Go-To Tech Expert

Teams Test Verification First

The AI Agent Blueprint: A Practical Playbook for Building Agentic Artificial Intelligence: Launch Your First Agent in 30 Days