TL;DR
Anthropic published a Claude Code engineering write-up on June 3, 2026 describing how it uses hundreds of reusable Skills across its engineering organization. The confirmed development is the company’s account of Skills as folders containing instructions, scripts, references and hooks, rather than saved prompts; the broader business impact is still being tested outside Anthropic.
Anthropic has published a Claude Code engineering report explaining how it uses hundreds of reusable Skills across its own engineering organization, presenting them as versioned folders of instructions, scripts and references rather than one-off prompts. The development matters because it points to a more durable way for companies to make AI coding agents follow shared operating practices instead of relying on repeated manual prompting.
The post, titled “Lessons from building Claude Code: How we use skills” and attributed to Thariq Shihipar of Anthropic, was published on the Claude blog on June 3, 2026. According to the source material, Anthropic says a Skill is structured as a folder that can include a SKILL.md file, references, scripts, assets, configuration, hooks and memory.
The confirmed technical point is definitional: Anthropic describes Skills as units that an agent can discover, read and run. The root instructions tell the model when to use the Skill, while supporting files can be loaded only when the task requires them. The source material frames that pattern as progressive disclosure, meaning the agent starts with a short instruction layer and then reaches into deeper material when needed.
Anthropic’s reported internal taxonomy groups Skills into nine categories: library or API reference, product verification, data fetching and analysis, business-process automation, code scaffolding and templates, code quality and review, CI/CD and deployment, runbooks, and infrastructure operations. The source material says verification Skills, which check an agent’s work, had the largest measured impact on output quality in Anthropic’s own use.
A Skill is a folder, not a prompt
Anthropic published what it learned running hundreds of Skills across its own engineering org. Read as a business memo, the point is bigger than a coding trick: this is how ad-hoc prompting becomes durable institutional capability — the SOPs your agents actually follow, versioned and shared.
“A Skill is just a clever markdown prompt you save in a file.”
A folder the agent can discover, read & run — instructions, scripts, references, templates, config & on-demand hooks.
The knowledge of how your organization actually operates can be captured, versioned, shared & executed — and the thing capturing it is a humble folder with a script and a gotchas list inside. For the builder, that’s context engineering with real tools attached. For whoever owns the budget, it’s the difference between AI that starts from zero every morning and an asset that compounds. Caveats: best practices are still evolving, checked-in Skills cost context, and curation beats accumulation. Start with one Skill, one gotcha, and the category that catches your mistakes.
The main consequence for readers is that Anthropic is presenting agent guidance as organizational infrastructure, not just clever prompt writing. If the model’s working instructions live in folders with scripts, templates and checks, teams can share, revise and version that knowledge the same way they manage other engineering assets.
That could matter for companies trying to make AI coding agents more consistent. A Skill can bundle tribal knowledge, project-specific rules and repeatable checks in one place, according to the source material. In practice, that means a new engineer’s agent and a senior engineer’s agent could be pointed at the same approved process for tasks such as release checks, code review or product verification.
The business claim is broader than the confirmed technical description. The Thorsten Meyer AI analysis argues that Skills can become an appreciating asset because teams can improve them as new edge cases appear. That is an interpretation of Anthropic’s write-up, not an independent measurement of return on investment across the wider market.

Designing Instruction with Generative AI: 24/7 Support for Optimizing Teaching and Learning
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
How Anthropic Structures Skills
The source material says a typical Skill begins with SKILL.md, which contains root instructions and a description written for the model. That description acts as the trigger for when the agent should use the Skill. Supporting folders can contain references for deeper detail, scripts for repeatable work, assets such as templates, configuration files and optional hooks.
The analysis also highlights several craft lessons attributed to Anthropic’s experience: write descriptions for the model rather than humans, avoid stating obvious information, include scripts instead of prose alone, use guardrail hooks for sensitive workflows and allow room for the agent to adapt. The source material says the strongest Skills often start small, then improve as teams record gotchas and edge cases.
The July 1, 2026 dispatch from Thorsten Meyer AI reframes the Anthropic post as a business memo. Its central reading is that Skills turn repeated prompting into durable institutional capability. That framing goes beyond Anthropic’s how-to material, but it is grounded in the company’s reported internal use of hundreds of Skills.
versioned code repository for AI projects
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Outside Results Still Unproven
Several points remain open. Anthropic’s reported results come from its own engineering organization, and the source material does not provide a full public dataset showing how hundreds of Skills performed across different companies, codebases or agent setups. It is also unclear how much maintenance burden a large Skill library creates over time.
The analysis warns that best practices are still evolving, that checked-in Skills can consume model context, and that curation may matter more than accumulation. It is not yet clear which teams can justify spending an engineer-week on a single Skill category, or which categories produce the best returns outside Anthropic’s environment.

From Scripting To Systems: A Practical Guide to Using AI Workflows That Save Time, Reduce Errors, and Make You the Go-To Tech Expert
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Teams Test Verification First
The next practical step for teams following Anthropic’s lead is likely to be narrow adoption rather than building large libraries immediately. The source material recommends starting with one Skill, one recurring gotcha and the category most likely to catch mistakes, especially verification.
Readers should watch whether Anthropic, Claude Code users or other AI toolmakers publish more evidence on quality gains, maintenance costs and patterns for sharing Skills across teams. The key test is whether folder-based Skills can make agent work more repeatable in real production settings, not only inside Anthropic’s internal workflows.

The AI Agent Blueprint: A Practical Playbook for Building Agentic Artificial Intelligence: Launch Your First Agent in 30 Days
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Key Questions
What did Anthropic publish?
Anthropic published a Claude Code engineering post on June 3, 2026 describing lessons from using hundreds of Skills across its engineering organization.
What is a Skill in this report?
A Skill is described as a folder that can include instructions, scripts, references, templates, configuration, hooks and memory. It is not presented as only a saved prompt.
Which Skill category had the biggest reported effect?
According to the source material, Anthropic found that verification Skills, which check an agent’s work, had the largest measured effect on output quality in its own use.
Why does this matter for companies using AI agents?
The approach could let teams package repeatable procedures, project rules and review checks into shared assets that agents can apply across tasks. That may reduce repeated prompting and make agent behavior more consistent.
What is still unknown?
It remains unclear how well Anthropic’s internal results apply to other companies, how much work large Skill libraries require to maintain, and which use cases produce the strongest measurable benefits.
Source: Thorsten Meyer AI