35

The Skills Paradigm: From Agents to Skill Libraries

⏱️ 35 min

The Skills Paradigm: From Agents to Skill Libraries

Background: Agents Are Smart but Unreliable

Over the past year "Agent" became the hot buzzword, but most Agents are more like "interns with good IQs but zero experience." They can finish tasks, but they lack stable professional processes and accumulated know-how:

  • They don't remember what worked last time
  • They can't build up experience over time
  • They can't share best practices across a team

That makes it really hard to put Agents into serious production environments.

Anthropic's Answer: Skills

Anthropic introduced a key concept: package capabilities into reusable, composable skills.

The core idea: stop stacking more Agents — build a skill library instead.

And the way Skills are implemented is surprisingly simple: a Skill is just a folder.

Skills Folder Diagram

The "Folder" Wins

Folders are the most universal organizational unit. Everyone understands them, they're easy to share, and they plug right into existing workflows:

  • Version with Git
  • Share via cloud storage
  • Package as a zip for distribution

A Skill folder can contain:

  • instructions.md: steps and gotchas
  • fetch_data.py: pull internal data
  • analyze.py: metrics analysis logic
  • report_template.pptx: output template

When an Agent needs to complete a task, it reads the folder, runs the scripts, and generates the output — like receiving a "toolkit + operations manual."

Progressive Disclosure: The Key to Scalability

Agents don't load all Skills at once. Instead:

  1. They first see skill metadata (name + summary)
  2. They only read full contents when needed

This lets Agents equip a massive number of skills without being limited by the context window.

Progressive Disclosure Diagram

From "Calling Tools" to "Driving Code"

Traditional Agents depend on APIs/Tools. When a tool has a bug, the Agent just... waits.

The Skills paradigm changes that relationship: code is the universal interface.

  • Code is the most precise documentation
  • Code can be modified and optimized
  • Code can be saved as new skills

This evolves Agents from "tool callers" to "code drivers," and the capability library grows with them.

From Individual Intelligence to Collective Wisdom

Skills aren't just valuable for a single Agent — they matter at the organizational level:

  • Base skills: Anthropic's built-in capabilities
  • Third-party skills: Deep integrations with products like Notion, Stagehand, etc.
  • Internal enterprise skills: Company processes and standards packaged as skills

The result: team best practices get continuously accumulated, and new team members can directly reuse the "veteran's" experience.

The OS & Apps of the AI Era

You can think of a complete agent architecture this way:

  • LLM = CPU
  • Agent Runtime = Operating System (OS)
  • Skills = Applications (Apps)

Real productivity comes from the "app ecosystem," not from reinventing the OS.

OS And Apps Diagram

Continuous Learning and Self-Evolution

The ultimate goal is for Agents to create Skills on their own:

When a workflow keeps recurring, the Agent can save it as a new skill folder, turning memory into controllable, shareable "procedural memory."

Practical Guide: Turning Experience into Skills

Here's a concrete skill-packaging workflow that teams can use to quickly accumulate best practices.

1) Define a "Reusable Task"

Pick a task your team does repeatedly but keeps getting wrong:

  • Generating weekly/operations reports
  • Financial statement analysis and reporting
  • Customer service scripts and knowledge base responses
  • Code deployment and rollback procedures

2) Design the Skill Folder Structure

Recommended structure:

skill-name/
  instructions.md
  scripts/
    fetch_data.py
    analyze.py
  templates/
    report_template.pptx
  data/
    sample.json

3) Write Clear instructions.md

Give the Agent a "stable execution SOP." It should include:

  • Goals and inputs/outputs
  • Core steps (numbered)
  • Common errors and how to handle them
  • Acceptance criteria for success

4) Script the Critical Steps

Prioritize scripting the steps that are error-prone or repetitive:

  • Data fetching
  • Metric calculation
  • Result formatting
  • Report export

5) Define Metadata (for Discoverability)

Tag each Skill with a short description of its purpose and prerequisites:

  • "For quarterly financial analysis, requires API credentials to be configured"
  • "For community weekly summary, input is CSV"

6) Add Samples and Tests

Include a minimal runnable sample so the Agent can quickly verify:

  • data/sample.json
  • scripts/test_run.py

Template: Financial Report Analysis Skill

You can use this template directly:

  • instructions.md: document the financial analysis steps
  • fetch_data.py: fetch the financial report
  • analyze.py: calculate key metrics
  • report_template.pptx: output template

This way, the Agent follows the same process every time, reducing uncertainty.

Common Pitfalls to Avoid

  • Pitfall 1: Stuffing everything into the Prompt Move stable workflows into Skills instead. The Prompt should only handle "invocation."

  • Pitfall 2: Skills that are too complex If a single Skill gets too big, split it into smaller skills and compose them.

  • Pitfall 3: No verifiable output Define "acceptance criteria" for each Skill's output to prevent results from drifting.

Takeaway

Anthropic's Skills paradigm is about moving expertise from "inside the Prompt" to "inside a skill library."

Shifting from building individual Agents to building a reusable skill ecosystem — that's probably the real path for AI-era applications.

📚 相关资源

❓ 常见问题

关于本章主题最常被搜索的问题,点击展开答案

Skill 跟 Agent 到底差在哪?为什么要换范式?

Agent 像没经验的实习生:聪明但记不住上次的成功做法,经验无法沉淀。Skill 把可复用任务封装成一个文件夹(instructions.md + scripts/ + templates/),用 Git 版本化、团队共享,模型每次执行都按同一 SOP 来。Anthropic 的口号:不要继续堆 Agent,构建技能库。

为什么 Skill 用「文件夹」这么朴素的形式?

文件夹是最普及的组织单元 — 人人懂,能用 Git 版本化、网盘共享、zip 分发,不需要新工具。一个 Skill 文件夹典型结构:instructions.md(SOP)+ fetch_data.py(数据拉取)+ analyze.py(分析)+ report_template.pptx(输出模板)。Agent 拿到的就是"工具包 + 操作手册"。

Progressive disclosure 在 Skills 里是怎么工作的?

Agent 不会一次性把所有 Skills 全装入 context,而是先看元数据(名字 + 简介),用到时再读完整内容。这样 context window 只承载当下需要的那一两个 Skill,理论上技能库可以无限扩 — 200 个 Skill 也不挤占 token,跟 RAG 检索资料是同一思路。

我的 Skill 越写越大,什么时候该拆?

instructions.md 超过两屏、脚本数 > 5、单个 SOP 出现明显的"小阶段"时就拆。本章三个常见误区里第二条直接点名:Skill 太复杂时拆成多个小技能组合使用。例:"季度财报分析"拆成"拉数据 / 算指标 / 生成 PPT"三个独立 Skill,按需组合,每个都能单独验收。

Skill 跟传统 API tool 调用的最大不同在哪?

传统 Agent 依赖 API/Tools — tool 有 bug 时 Agent 只能干等。Skills 把关系翻转成「代码即通用接口」:代码本身就是最精确的文档,可被 Agent 修改、优化、沉淀为新技能。模型从"工具调用者"进化为"代码驾驭者",能力库随之可生长,而不是被外部工具厂商卡脖子。

Diagram Switcher

Click to switch and quickly compare three perspectives

Skill Folder Structure

One Skill = one folder, containing instructions, scripts, templates, and sample data.