软件层的退场与重构:AI-Native 时代,你的产品下一个用户是另一个 Agent

🍄 原文发布于 blog.mushroom.cv

结论先行（BLUF）：软件层不是在"消失"，而是在被重构为 agent 的调度内核。三个命题必须分开讨论：代码不会消失（神经网络权重本身也是代码，只是不再由人手写）；可见 GUI 与"以应用为单位"的交付正在解构（这正在发生）；SaaS 的座席计费模式正被"成果计费"替代（争议最大，但已有早期实证）。

Software 3.0：意图取代功能按钮

Andrej Karpathy 在 2025 年 6 月 16 日的 YC AI Startup School 演讲中系统提出了 Software 3.0 框架：LLM 是新一代操作系统，上下文窗口即 RAM，模型权重即 CPU，提示词即编程。

"LLMs are a new kind of computer, and you program them in English." —— Andrej Karpathy, YC AI Startup School 2025

最小交互单位从"功能按钮"升格为**"意图表达"**——小饭店老板拍张菜单照说一句话，直达精修海报成品；中间所有工具链（OCR、前端、API）被收敛进 LLM 调度内核，传统脚手架整体消失。

Karpathy 同时给出了对软件公司最重要的判断：

"LLMs are the new primary consumer/manipulator of digital information. Build for agents." —— Andrej Karpathy, AI Startup School 2025

交互范式：六级台阶，L4 是过渡，L5 是终局

级别	形态	代表产品	现状
L1	App 图标 + 触屏点选	iOS/Android	存量主流
L2	Chatbot + 单步工具调用	ChatGPT plugins (2023)	已普及
L3	对话 + 生成式 UI	Claude Artifacts, ChatGPT Canvas	正在普及
L4	Operator 代点鼠标	Anthropic Computer Use, OpenAI Operator	Demo→早期落地
L5	Agent ↔ 服务协议直连	MCP / A2A / NLWeb	事实标准形成期
L6	意图执行 + 极简验证层	—	愿景

关键判断：L4"截图+点击"只是过渡——一旦服务端普遍提供 MCP/A2A 接口，AI 就没必要再点鼠标。Stripe、Notion、Linear、Asana、Intercom 已发布官方 MCP server，绕过 GUI 直接对话。Anthropic 于 2024-11-25 发布 MCP，OpenAI、Google、Microsoft 在 2025 年 3–5 月相继原生支持，18 个月内成为事实标准。

本地 AI（LAAS）：三层架构，不是纯本地

LAAS 的三个真问题成立：隐私与数据主权、低延迟与离线能力、token 经济成本。但更精确的答案不是"纯本地"，而是：

端侧小模型 + 可验证云（Apple PCC 模式）+ 协议互通

Apple Private Cloud Compute 是行业第一个把"云端隐私可证明"工程化的尝试——硬件用 Apple Silicon 服务器，运行时不存储用户数据，外部可远程验证。端侧能力边界（2026 年中）：

**手机**（iPhone 17 Pro 等）：1B–4B 模型，摘要、改写、单步工具调用
**笔记本**（MacBook Air M4 / Copilot+ PC）：3B–8B，短链 agent、IDE 内补全
**工作站**（Mac Studio M3 Ultra）：30B–70B，接近 Claude Code 级体验

超长上下文（>1M tokens）、复杂 reasoning、多 agent 协作调度仍须云端。

商业模式转轨：成果计费正在发生

Foundation Capital 2024 年估算：全球服务市场约 2.4 万亿美元，软件市场约 4000 亿——即每 1 美元软件支出对应 6 美元服务支出。若 AI 能将服务相当部分"软件化"，市场重估将极其剧烈。

早期实证：

**Sierra**（CEO Bret Taylor）：按"每解决一次客服对话"计费，而非座席数。详见 [τ²-bench 方法论](https://sierra.ai/blog/benchmarking-ai-agents-with-tau2)
**Salesforce Agentforce**：每次对话 2 美元；Benioff 公开宣布 2025 年全年停止招聘软件工程师
**Cognition Labs Devin**：以 ACU（Agent Compute Unit）计费

三条值得长期下注的方向

① 协议层公民：把核心能力暴露为 MCP server——这是 agent 时代的"SEO"，决定你的服务是否对 agent 可见。今天没有 MCP server，等于被 agent 时代"看不见"。参见 Google A2A Protocol。

② 领域数据飞轮：通用模型同质化是必然，但"你拥有而别人没有的领域数据 + 持续生成的高质量行为 trace"，是 LAAS 时代真正的护城河。Anthropic Economic Index 显示计算机与数学类任务占 Claude 使用的 37.2%——领域专精仍有巨大空间。

③ 意图设计与品牌信任：技术执行力被 AI 抹平后，用户会把意图交给"他最信任的品牌"。品牌的角色将从"营销表层"重回"产品核心"——这是对"craftsmanship 不可替代"（DHH 的持续论点）的另一面诠释。

结语

软件不会消失，软件的"边界"会消失。当代码本身被工具化，"理解一个复杂系统、定义清楚要做什么、并对结果负责" 这件事比写代码本身更稀缺，也更有价值。

对软件从业者的实际建议：

立刻把核心能力暴露为 MCP server——你产品的下一个主用户是另一个 agent
把 PRD/Spec 当作一等代码工件管理，它将成为新研发流程的源头
新增 agent threat model：把 prompt injection、tool poisoning 加入安全审查清单（[Invariant Labs 2025-04 MCP 安全报告](https://invariantlabs.ai/blog/mcp-security-notification-tool-poisoning-attacks)）
认真对待"Operator 类是过渡形态"——战略押注在"服务对 agent 友好"而非"AI 替人点鼠标"

主要参考

Karpathy, "Software Is Changing (Again)", YC AI Startup School 2025-06-16 — [链接](https://www.ycombinator.com/library/MW-andrej-karpathy-software-is-changing-again)
Anthropic, Model Context Protocol 发布 2024-11-25 — [链接](https://www.anthropic.com/news/model-context-protocol)
Anthropic, "Building Effective Agents" 2024-12-19 — [链接](https://www.anthropic.com/research/building-effective-agents)
Apple, Private Cloud Compute 技术说明 2024-06 — [链接](https://security.apple.com/blog/private-cloud-compute/)
Google, A2A Protocol 发布 2025-04-09 — [链接](https://developers.googleblog.com/en/a2a-a-new-era-of-agent-interoperability/)
Foundation Capital, "AI is leading a service as software paradigm shift" 2024-04-19 — [链接](https://foundationcapital.com/ai-service-as-software/)
METR, Measuring AI Ability to Complete Long Tasks 2025-03 — [链接](https://metr.org/blog/2025-03-19-measuring-ai-ability-to-complete-long-tasks/)
Simon Willison, "Not all AI-assisted programming is vibe coding" 2025-03-19 — [链接](https://simonwillison.net/2025/Mar/19/vibe-coding/)
Anthropic Economic Index 2025-02 — [链接](https://www.anthropic.com/economic-index)
Geoffrey Litt, "Malleable Software in the Age of LLMs" 2023 — [链接](https://www.geoffreylitt.com/2023/03/25/llm-end-user-programming.html)
Maggie Appleton, "Home-Cooked Software and Barefoot Developers" 2024 — [链接](https://maggieappleton.com/home-cooked-software)
Sierra, τ²-bench — [链接](https://sierra.ai/blog/benchmarking-ai-agents-with-tau2)
Apollo Research, Scheming Reasoning Evaluations 2024-12 — [链接](https://www.apolloresearch.ai/research/scheming-reasoning-evaluations)
Invariant Labs, MCP Tool Poisoning Attacks 2025-04 — [链接](https://invariantlabs.ai/blog/mcp-security-notification-tool-poisoning-attacks)

© 2026 Author: Mycelium Protocol. 本文采用 CC BY 4.0 授权——欢迎转载和引用，须注明作者姓名及原文链接，不得去除署名后以原创发布。

BLUF: The software layer isn't disappearing — it's being restructured into an agent orchestration kernel. Three propositions must be separated: code won't disappear (neural network weights are also code, just no longer hand-written by humans); visible GUI and app-as-delivery-unit are being deconstructed (this is happening now); SaaS seat-based billing is being replaced by outcome-based billing (most contested, but with early empirical proof).

Software 3.0: Intent Replaces Feature Buttons

Andrej Karpathy's YC AI Startup School keynote (2025-06-16) systematically introduced the Software 3.0 framework: LLMs are the new OS — context window is RAM, model weights are CPU, prompting is programming.

The minimum interaction unit has upgraded from "feature button" to "intent expression" — a restaurant owner photographs a menu and says one sentence, directly producing a polished social media post; all intermediate tooling (OCR, frontend, APIs) collapses into the LLM orchestration kernel.

"LLMs are the new primary consumer/manipulator of digital information. Build for agents."

Interaction Paradigms: Six Levels, L4 Is Transitional, L5 Is the Endgame

Level	Form	Representative Products	Status
L1	App icon grid + touch	iOS/Android	Existing mainstream
L2	Chatbot + single tool calls	ChatGPT plugins (2023)	Widespread
L3	Chat + Generative UI	Claude Artifacts, ChatGPT Canvas	Spreading
L4	Operator "click-for-me" agents	Computer Use, OpenAI Operator	Demo → early deployment
L5	Agent ↔ Service protocol direct connect	MCP / A2A / NLWeb	De facto standard forming
L6	Intent execution + minimal verification	—	Vision

Key judgment: L4 "screenshot+click" is transitional — once services universally provide MCP/A2A interfaces, AI has no reason to click mice. Stripe, Notion, Linear, Asana, Intercom have already released official MCP servers, bypassing GUI for direct dialogue. Anthropic released MCP on 2024-11-25; OpenAI, Google, and Microsoft natively supported it within 18 months — a rare instance of an open standard achieving de facto monopoly so quickly.

Local AI (LAAS): Three-Layer Architecture, Not Pure Local

LAAS's three genuine problems are valid: privacy/data sovereignty, low-latency/offline capability, unsustainable token economics. But the accurate answer isn't "pure local" — it's:

On-device small models + verifiable cloud (Apple PCC model) + protocol interoperability

Apple's Private Cloud Compute is the industry's first engineering attempt at "provably private cloud computation." On-device capability ceilings (mid-2026):

**Phone** (iPhone 17 Pro): 1B–4B models, summarization, single-step tool calls
**Laptop** (MacBook Air M4): 3B–8B, short-chain agents, IDE completion
**Workstation** (Mac Studio M3 Ultra): 30B–70B, near Claude Code-level experience

Ultra-long context (>1M tokens), complex reasoning, and multi-agent orchestration still require cloud.

Business Model Transition: Outcome Billing Is Happening

Foundation Capital estimates: global services market ~$2.4T, software market ~$400B — every $1 software spend corresponds to $6 service spend. If AI can "software-ize" significant portions of services, market repricing will be extreme.

Early proof: Sierra (CEO Bret Taylor) bills per "resolved customer service conversation," not per seat. Salesforce Agentforce charges $2 per conversation; Benioff publicly announced halting all software engineer hiring in 2025.

Three Long-Term Investment Directions

① Protocol-layer citizenship: Expose core capabilities as MCP servers — this is the "SEO" of the agent era. No MCP server today = invisible to agents tomorrow.

② Domain data flywheel: General models will commoditize. "Domain data you own that others don't + high-quality behavior traces you continuously generate" is the moat of the LAAS era.

③ Intent design and brand trust: When technical execution is leveled by AI, users give their intent to the brand they trust most. Brand returns from "marketing surface" to "product core."

Key References

Karpathy, YC AI Startup School 2025 — [link](https://www.ycombinator.com/library/MW-andrej-karpathy-software-is-changing-again)
Anthropic, MCP release 2024-11 — [link](https://www.anthropic.com/news/model-context-protocol)
Anthropic, "Building Effective Agents" — [link](https://www.anthropic.com/research/building-effective-agents)
Apple, Private Cloud Compute — [link](https://security.apple.com/blog/private-cloud-compute/)
Google, A2A Protocol — [link](https://developers.googleblog.com/en/a2a-a-new-era-of-agent-interoperability/)
Foundation Capital, "Service as Software" — [link](https://foundationcapital.com/ai-service-as-software/)
METR, Long Task Measurement 2025-03 — [link](https://metr.org/blog/2025-03-19-measuring-ai-ability-to-complete-long-tasks/)
Invariant Labs, MCP Tool Poisoning — [link](https://invariantlabs.ai/blog/mcp-security-notification-tool-poisoning-attacks)

© 2026 Author: Mycelium Protocol. Licensed under CC BY 4.0 — free to share and adapt with attribution. You must credit the author and link to the original; removing attribution and republishing as original is not permitted.

🍄

Mycelium

🪵 Infras | 🦠 Protocols | 🕸️ Networks

📍 blog.mushroom.cv