🔥 最炸裂的判断
1
犀利
"Intelligence alone is not going to compound. Without memory, task 50 is no better than task 1."
这不是能力问题,是架构问题。模型变聪明了,但如果没有 Context Engineering,你的 Agent 永远不会从经验中成长。
2
犀利
"CLAUDE.md was unreasonably effective — a markdown file that just gives the agent a couple of instructions was so good at steering it."
一个 Markdown 文件的效果好到不合理。这是 Anthropic 整条记忆进化路线的起点——简单的东西先行,复杂的方案后补。Do the simple thing that works.
3
犀利
"Agents are actually just very good at using Bash and Grep. Just let them search over the file system."
Anthropic 当前最佳实践:别设计专门的记忆 API,把记忆放进 Markdown 文件,让 Agent 用标准工具自己搜。文件系统就是最好的记忆系统。
4
犀利
"All of this stuff has only happened in this past year. This is very much an open area of research."
从 CLAUDE.md 到 Dreaming,整条进化路线只花了一年。领域远未成熟,最佳实践还在剧烈演化中。
🏗️ 架构洞察
5
架构
"Skills: it's as if I had a bookshelf — I can scan the titles and pick one off the shelf when I need to."
渐进式披露(Progressive Disclosure)的精髓:Agent 只看书脊标题,不把每本书背下来。Skill 顶部的 front matter 就是书脊——几行字就能决定要不要展开。
6
对比
带内记忆 In-band
- 会话中读写
- 资源与任务共享
- 单 session 视野
- 下次立即生效
Dreaming Out-of-band
- 会话间异步运行
- 独立 token 预算
- 跨所有 Agent 视野
- 下一轮批处理后生效
两者不是替代关系,而是双轮驱动。带内负责即时反馈,Dreaming 负责全局模式识别。
7
工程
"When an agent wants to write, it takes a hash before and after. If they don't match, the write is rejected."
本质就是乐观锁。Agent 更新记忆时先拍快照,写完再校验。hash 不匹配就重试。多 Agent 并发写记忆的工程解法。
8
架构
"Dreaming does not change model weights — the system improves even if the model stays the same."
Dreaming 是外部记忆层的整理流程,不是微调或 RLHF。不需要 GPU,不需要训练基础设施,随时可以回滚。
🛡️ 生产实践
9
工程
"You wouldn't want one agent to just decide it should update the organization-wide context. That should be read-only."
记忆需要权限分层:组织级只读、项目级受限写入、Agent 个人级完全读写。一个 Agent 的错误判断不该扩散到整个组织。
10
工程
"We're not just looking at the back-and-forth — we're also really scrutinizing tool calls and all the metadata."
Dreaming 分析 transcript 时不只看对话内容——工具调用、技能使用、错误日志等元数据同等重要。工具配置出错只有通过元数据才能发现。
11
实用
"You have the ability to steer how dreaming agents go about the problem — tell them what's important and what's not."
Dreaming 不是黑盒。你可以定向引导:告诉整理 Agent 你的场景里哪类模式值得关注、哪类可以忽略。
📊 数据 & 案例
12
数据
"Harvey's task completion rates rose roughly 6x in internal testing once Dreaming was turned on."
法律 AI 公司 Harvey 的案例:Agent 之前在 session 间反复忘记文件格式坑和工具变通方法,同样的任务反复失败。Dreaming 持久化了这些经验,打破了失败循环。Anthropic 内部也看到 +10 百分点任务成功率、+8.4% docx 质量提升。
13
成本
"It sounds expensive — why chuck extra resources at this? But agents one-shot things more effectively, so costs go down."
Dreaming 看似额外开销,实际上是投资:Agent 变聪明后,执行任务的 token 消耗和重试次数减少,总成本反而下降。
💬 全场最佳
14
犀利
"At which point are we reinventing databases from first principles again?"
全场最佳提问。版本控制、并发锁、权限系统——这些不就是数据库几十年前解决的问题吗?Lamis 坦承"确实在回归这些实践",但强调路径是:先让 Agent 自由探索,观察什么有效,再把验证过的模式硬编码回 harness。
15
实用
"This is not coding specific — I use memory for presentations, writing style, slide preferences. It develops over time."
记忆系统的价值远不止代码。Lamis 自己就用它来记住写作风格、演示偏好。任何需要"下次做得更好"的场景都适用。