Meta-Meta-Prompting: The Secret to Making AI Agents Work
Garry Tan, CEO of Y Combinator, presents GBrain, his personal AI agent system built on 100,000 pages of structured knowledge and over 100 modular skills. The core architecture follows a “thin harness, fat skills, fat data” philosophy: a lightweight runtime like OpenClaw routes messages to self-contained skill files, which are themselves created and improved by a meta-skill called Skillify. Tan illustrates the compounding value through the “book-mirror” pipeline, which cross-references a book’s ideas with his actual life events, journal entries, and meeting notes. He details the evolution from an error-prone first version to a reliable workflow using multi-model cross-modal evaluation and deep brain retrieval. Other examples include automated meeting preparation that synthesizes months of accumulated context and entity propagation that updates every related person or company page after a conversation. The article provides a concrete architecture overview, evidence of iterative improvement, and a four-step starting guide for developers building personal compounding AI systems.
People keep asking me why I am spending my nights coding til 2AM. I have a job and a big one, as CEO of Y Combinator. We help thousands of builders a year to create their dreams of building real startups with real revenue that grow fast.
In the last 5 months, AI made me a builder again. Late last year, the tools got good enough that I went back to building. Not toy projects. Real systems that compound. I want to show you, with specific examples, what personal AI actually looks like when you stop treating it as a chat window and start treating it as an operating system. And I give it away as open source and in articles like this because I want you to speed up with me.
This is part of a series: Fat Skills, Fat Code, Thin Harness introduced the core architecture. Resolvers covered the routing table for intelligence. The LOC Controversy was about how every technical person just multiplied themselves by 100x to 1000x. Naked models are stupider argued that the model is the engine, not the car. And the skillify manifesto explained why LangChain raised $160M and gave you a squat rack and dumbell set without a workout plan, and then gave you that workout plan you needed.
人们总是问我,为什么我每晚都编码到凌晨两点。我有一份工作,而且是一份大工作——Y Combinator 的 CEO。每年我们帮助成千上万的构建者,实现他们打造真正初创公司的梦想,这些公司有真实的收入并快速增长。
在过去的五个月里,AI 让我重新成为一个构建者。去年年底,工具变得足够好了,我回到了构建的状态。不是玩具项目,而是真正能产生复利的系统。我想用具体的例子向你展示,当你不再把个人 AI 当成一个聊天窗口,而是当作一个操作系统时,它实际的样子。我把它开源,通过文章分享出来,因为我希望你能和我一起加速。
这是一个系列的一部分:Fat Skills, Fat Code, Thin Harness 介绍了核心架构;Resolvers 涵盖了智能的路由表;The LOC Controversy 讲的是每个技术人员如何将自己放大 100 到 1000 倍;Naked models are stupider 论述了模型是引擎而不是汽车;skillify 宣言则解释了为什么 LangChain 拿了 1.6 亿美元融资却只给了你一副深蹲架和哑铃而没有训练计划,然后提供了你真正需要的训练计划。
Last month I was reading Pema Chödrön's When Things Fall Apart. It's 162 pages, 22 chapters on Buddhist approaches to suffering, groundlessness, and letting go. A friend recommended it during a hard period.
I asked my AI to do a book mirror.
What that means concretely: The system extracted all 22 chapters of the book, and then, for each chapter, ran a sub-agent that did two things simultaneously: summarized the author's ideas, and then mapped every idea to my actual life. Not generic "this applies to leaders" pablum. Specific mapping. It knows my family history (immigrant parents, dad from Hong Kong and Singapore, mom from Burma). It knows my professional context (running YC, building open-source tools, mentoring thousands of founders). It knows what I've been reading, what I've been thinking about at 2am, what my therapists and I are working on.
The output was a 30,000-word brain page. Each chapter rendered as two columns: what Pema says, and how it maps to what I'm actually living through. The chapter on groundlessness connected to a specific founder conversation I'd had the week before. The chapter on fear mapped to patterns my therapist had identified. The chapter on letting go referenced a late-night session where I'd written about the creative freedom I'd found this year.
The whole thing took about 40 minutes. A $300/hour therapist reading this book and applying it to my life couldn't do this in 40 hours, because they don't have the full graph of my professional context, my reading history, my meeting notes, and my founder relationships all loaded and cross-referenceable.
I've done this with over 20 books now: Amplified (Dion Lim), Autobiography of Bertrand Russell, Designing Your Life, Drama of the Gifted Child, Finite and Infinite Games, Gift from the Sea (Lindbergh), Siddhartha (Hesse), Steppenwolf (Hesse), The Art of Doing Science and Engineering (Hamming), The Dream Machine, The Book on the Taboo Against Knowing Who You Are (Alan Watts), What Do You Care What Other People Think (Feynman), When Things Fall Apart (Pema Chodron), A Brief History of Everything (Ken Wilber), and more. Each one gets richer because the brain gets richer. The second mirror knew about the first. The twentieth knew about all nineteen.
上个月我读了 Pema Chödrön 的《当生命坠落时》。全书 162 页,22 章,讲的是佛教如何看待痛苦、无常和放手。一位朋友在我艰难时期推荐了这本书。
我让我的 AI 做一个“图书镜像”。
具体来说:系统提取了全书 22 章,然后对每一章运行一个子代理,同时做两件事:总结作者的观点,并将每个观点映射到我的实际生活。不是那种“这适用于领导者”的泛泛之谈,而是具体的映射。它了解我的家庭背景(父母是移民,父亲来自香港和新加坡,母亲来自缅甸),了解我的职业背景(运营 YC,构建开源工具,指导数千名创始人),知道我在读什么,凌晨两点在想什么,我的治疗师和我在做什么。
输出是一篇 3 万字的脑页面。每一章呈现为两栏:Pema 说了什么,以及它如何映射到我正在经历的生活。关于无常的那一章,与我上周与某位创始人的对话联系起来。关于恐惧的那一章,映射到我的治疗师已经识别的模式。关于放手的那一章,引用了我深夜写下的关于今年找到的创作自由的笔记。
整个过程花了大约 40 分钟。一个每小时收费 300 美元的治疗师,读了这本书再应用到我的生活,40 小时也做不到这一点。因为他们没有完整加载并交叉引用我的职业背景、阅读历史、会议笔记和创始人关系图谱。
我现在已经这样做了 20 多本书:《Amplified》(Dion Lim)、《Bertrand Russell 自传》、《设计你的人生》、《天才儿童的悲剧》、《有限与无限的游戏》、《海之礼》(Lindbergh)、《悉达多》(Hesse)、《荒原狼》(Hesse)、《科学与工程的艺术》(Hamming)、《梦想机器》、《认识你自己的禁忌之书》(Alan Watts)、《你在乎别人怎么想》(Feynman)、《当生命坠落时》(Pema Chodron)、《万法简史》(Ken Wilber) 等。每一本都更加丰富,因为大脑本身在变丰富。第二个镜像知道了第一个,第二十个知道了前十九个。
The first book mirror I did was terrible. Version 1 had three factual errors about my family. It said my parents were divorced when they weren't. Said I grew up in Hong Kong when I was born in Canada. Basic stuff that could have damaged trust if I'd shared it.
So I added a mandatory fact-check step. Every mirror now runs cross-modal evaluation against known facts in the brain before it ships. Opus 4.7 1M catches precision errors. GPT-5.5 catches missing context. DeepSeek V4-Pro catches when something reads as generic.
Then I upgraded to deep retrieval with GBrain tool use. The original version was good at synthesis but weak on specificity. Version 3 does per-section brain searches. Every right-column entry cites actual brain pages. When the book talks about dealing with difficult conversations, it doesn't just synthesize general principles. It pulls from my actual meeting notes with specific founders who were having tough conversations with co-founders. Or that idea I had on a Thursday hanging out with my brother James. Or the IM chat I had with my college roommate when I was 19. It's uncanny.
This is what skillification (using /skillify in GBrain) means in practice. I took the first manual attempt, extracted the repeatable pattern, wrote a tested skill file with triggers and edge cases, and every fix compounded across all future book mirrors.
我做的第一个图书镜像非常糟糕。V1 版本有三个关于我家庭的事实性错误。它说我父母离婚了,实际上没有;说我在香港长大,而我出生在加拿大。这些基本错误如果分享出去,会破坏信任。
所以我增加了一个强制的事实核查步骤。现在每个镜像在输出之前,都会针对大脑中的已知事实进行跨模型评估。Opus 4.7 1M 捕捉精度错误,GPT-5.5 捕捉缺失的上下文,DeepSeek V4-Pro 捕捉那些读起来太泛泛的内容。
然后我升级到使用 GBrain 工具的深度检索。原始版本擅长综合,但缺乏具体性。V3 版本对每个部分进行大脑搜索。每一个右栏条目都引用实际的脑页面。当书里讲到处理棘手对话时,它不只是综合一般原则,而是从我与特定创始人的实际会议笔记中抽取——那些创始人正与联合创始人进行艰难的对话。或者是我周四和哥哥 James 闲逛时冒出的想法,或者是我 19 岁时和大学室友的即时消息聊天。这简直不可思议。
这就是技能化(在 GBrain 中使用 /skillify)在实践中的含义。我拿第一次手动尝试,提取出可重复的模式,编写了一个经过测试的技能文件,包含触发条件和边缘情况,每个修复都会在所有未来的图书镜像中产生复利。
Here's where it gets recursive, and where I think the biggest insight is.
The system that runs my life didn't exist as a monolith. It was assembled from skills. And those skills were themselves created by a skill.
Skillify is a meta-skill that creates new skills. When I encounter a workflow I'm going to repeat, I say "skillify this" and it examines what just happened, extracts the repeatable pattern, writes a tested skill file with triggers and edge cases, and registers it in the resolver. The book-mirror pipeline was skillified from the first time I did it manually. The meeting-prep workflow was skillified after I noticed I was doing the same steps before every call.
Skills compose. Book-mirror calls brain-ops for storage, enrich for context, cross-modal-eval for quality, and pdf-generation for output. Each skill is focused on one thing. They chain together to create complex workflows. When I improve one skill, every workflow that uses it gets better automatically. No more "forgot to mention this edge case in my prompt." The skill remembers.
这就是递归之处,也是我认为最大的洞见所在。
管理我生活的系统并非一个整体。它由技能组装而成,而这些技能本身又是由一个技能创建的。
Skillify 是一个元技能,用于创建新的技能。当我遇到一个会重复的工作流时,我说“skillify this”,它会检查刚才发生了什么,提取可重复的模式,写一个经过测试的技能文件(包含触发条件和边缘情况),然后在解析器/registrar 中注册。图书镜像管道从我第一次手动操作时就技能化了。会议准备工作流则是在我注意到每次通话前都在做同样的步骤之后被技能化的。
技能可以组合。图书镜像调用 brain-ops 进行存储,enrich 获取上下文,cross-modal-eval 保证质量,pdf-generation 生成输出。每个技能专注于一件事。它们链接在一起来创建复杂的工作流。当我改进一个技能时,所有使用它的工作流都会自动变好。不再有“忘记在提示词中提到这个边缘情况”的情况。技能会记住这些。
Demis Hassabis came to YC for a fireside chat. Sebastian Mallaby's biography of him had just come out.
I asked the system to prep me.
In under two minutes it pulled: Demis's full brain page (which had been accumulating for months from articles, podcast transcripts, and my own notes). His published beliefs about AGI timelines ("50% scaling, 50% innovation," thinks AGI is 5-10 years away). The Mallaby biography highlights. His stated research priorities (continual learning, world models, long-term memory). Cross-references to things I've said publicly about AI. Three demo scripts for showing the brain's multi-hop reasoning capability during the conversation. And a set of conversation hooks based on where our worldviews overlap and diverge.
This wasn't just a better Google search. This was preparation that used my accumulated context about Demis, my own positions, and the strategic goals for the conversation. The system prepped not just facts, but angles.
Demis Hassabis 来 YC 做炉边谈话。Sebastian Mallaby 为他写的传记刚出版。
我让系统为我做准备。
不到两分钟,它就拉取到了:Demis 的完整脑页面(几个月来从文章、播客文字稿和我自己的笔记中积累的);他公开发表的关于 AGI 时间表的观点——“50% 规模扩展,50% 创新”,认为 AGI 在 5-10 年内实现;Mallaby 传记的亮点;他陈述的研究重点(持续学习、世界模型、长期记忆);与我公开谈论过 AI 的内容的交叉引用;三个演示脚本,用于在对话中展示大脑的多跳推理能力;以及一套基于我们世界观重合与分歧的对话钩子。
这不仅仅是更好的谷歌搜索。这是利用我关于 Demis 的积累背景、我自己的立场以及对话的战略目标所做的准备。系统准备的不仅仅是事实,还有角度。
I maintain a structured knowledge base with about 100,000 pages. Every person I meet gets a page with a timeline, a state section (what's currently true), open threads, and a score. Every meeting gets a transcript, a structured summary, and something I call entity propagation: after every meeting, the system walks through every person and company mentioned and updates their brain pages with what was discussed. Every book I read gets a chapter-by-chapter mirror. Every article, podcast, and video I engage with gets ingested, tagged, and cross-referenced.
The schema is simple. Each page has: compiled truth at the top (the current best understanding), an append-only timeline below (events in chronological order), and raw data sidecars for source material. Think of it as a personal Wikipedia where every page is continuously updated by an AI that was at the meeting, read the email, watched the talk, and ingested the PDF.
Here's an example of how this compounds. I meet a founder at office hours. The system creates or updates their person page, their company page, cross-references the meeting notes, checks if I've met them before (and surfaces what we discussed last time), checks their application data, pulls their latest metrics, and identifies if any of my portfolio companies or contacts are relevant to their problem. By the time I walk into the next meeting with them, the system has a full context pack ready.
This is the difference between having a filing cabinet and having a nervous system. The filing cabinet stores things. The nervous system connects them, flags what's changed, and surfaces what's relevant to right now.
我维护着一个结构化的知识库,大约有 10 万个页面。我遇到的每个人都会有一个页面,包含时间线、状态栏(当前真实情况)、未结主题和评分。每次会议都会有文字记录、结构化摘要,以及我称之为“实体传播”的内容:每次会议后,系统会遍历提到的每个人和每家公司,用讨论的内容更新他们的脑页面。我读的每本书都会有一个逐章镜像。我参与的每篇文章、播客和视频都会被摄取、标记和交叉引用。
模式很简单。每个页面都有:顶部是编译后的真相(当前最佳理解),下面是仅追加的时间线(按时间顺序排列的事件),以及用于源材料的原始数据侧栏。可以把它想象成一个个人维基百科,其中每个页面都由一个 AI 持续更新,这个 AI 参加了会议、读了邮件、看了演讲、摄取了 PDF。
这里有一个复利的例子。我在办公时间会见一位创始人。系统会创建或更新他们的个人页面、公司页面,交叉引用会议笔记,检查我是否以前见过他们(并显示上次讨论的内容),检查他们的申请数据,拉取他们最新的指标,并识别我的投资组合公司或联系人中是否有与他们的相关问题相关的。等到我下次与他们开会时,系统已经准备好了完整的上下文包。
这就是拥有文件柜和拥有神经系统之间的区别。文件柜存储东西。神经系统连接它们,标记出已变化的内容,并显示出与当下相关的内容。
Here's how it works. I think this is the right way to build personal AI, and I open-sourced the whole thing so you can build it yourself.
The harness is thin. OpenClaw is the runtime. It receives my messages, figures out which skill applies, and dispatches. A few thousand lines of routing logic. It doesn't know anything about books or meetings or founders. It just routes.
The skills are fat. Over 100 of them now, each a self-contained markdown file with detailed instructions for one specific task. You've already seen book-mirror and meeting-prep above. Here are a few more that ship with GBrain:
- meeting-ingestion: After every meeting, it pulls the transcript, creates a structured summary, and then walks through every person and company mentioned and updates their brain pages with what was discussed. The meeting page is not the end product. The entity propagation back to every person and company page is the real value.
- enrich: Give it a person's name. It pulls from five different sources, merges everything into a single brain page with career arc, contact info, meeting history, and relationship context. Cited sources on every claim.
- media-ingest: Handles video, audio, PDF, screenshots, GitHub repos. Transcribes, extracts entities, files to the right brain location. I use this constantly for YouTube videos, podcasts, and voice memos.
- perplexity-research: Brain-augmented web research. Searches the web via Perplexity, but before synthesizing, checks what the brain already knows so it can tell you what's actually new vs. what you've already captured. I have dozens more I've built for my own work that I'll probably open source: email-triage, investor-update-ingest that detects portfolio updates in my email and extracts metrics into company pages, calendar-check for conflict detection and travel impossibility, and a whole journalistic research stack I use for civic work. Each skill encodes operational knowledge that would take a new human assistant months to learn. When someone asks how I "prompt" my AI, the answer is: I don't. The skills are the prompts.
The data is fat. 100,000 pages of structured knowledge in the brain repo. Every person, company, meeting, book, article, and idea I've engaged with, all linked, all searchable, all growing every day.
The code is fat. The code that feeds it (scripts for transcription, OCR, social media archival, calendar sync, API integrations) matters too, but the data is where the compound value lives. I run more than 100 crons per day that check all the things: social media, Slack, email, whatever I pay attention to, my OpenClaw/Hermes Agents look at for me too.
The models are interchangeable. I run Opus 4.7 1M for precision. GPT-5.5 for recall and exhaustive extraction. DeepSeek V4-Pro for creative work and third perspectives. Groq with Llamma for speed. The skill decides which model to call for which task. The harness doesn't care. When someone asks "which AI model is best," the answer is: wrong question. The model is just the engine. Everything else is the car.
下面是它的运作方式。我认为这是构建个人 AI 的正确方式,而且我已经开源了全部内容,这样你也可以自己构建。
框架是薄的。OpenClaw 是运行时。它接收我的消息,判断哪个技能适用,然后分发。几千行的路由逻辑。它对书籍、会议或创始人一无所知,只管路由。
技能是厚的。现在有超过 100 个技能,每个都是一个独立的 markdown 文件,包含针对一个特定任务的详细指令。你已经看到了上面的 book-mirror 和 meeting-prep。以下是随 GBrain 一起提供的另外几个:
- meeting-ingestion:每次会议后,它会拉取文字记录,创建结构化摘要,然后遍历提到的每个人和每家公司,用讨论的内容更新他们的脑页面。会议页面不是最终产品,实体传播回每个个人和公司页面才是真正的价值。
- enrich:输入一个人的名字。它会从五个不同来源拉取数据,将所有信息合并到一个脑页面中,包含职业轨迹、联系方式、会议历史和关系背景。每个主张都有引用来源。
- media-ingest:处理视频、音频、PDF、截图、GitHub 仓库。转写、提取实体、归档到正确的大脑位置。我经常用它来处理 YouTube 视频、播客和语音备忘录。
- perplexity-research:大脑增强的网络研究。通过 Perplexity 搜索网络,但在综合之前,它会检查大脑已经知道什么,这样就能告诉你什么是真正新的,什么是你已经捕获的。 我为自己工作构建了更多的技能,将来可能会开源:email-triage、investor-update-ingest(检测邮件中的投资组合更新并将指标提取到公司页面)、calendar-check(用于冲突检测和旅行不可行性检测),以及一整套用于公民工作的新闻研究栈。每个技能都编码了操作知识,一个新人助理需要几个月才能学会。当有人问我如何给我的 AI“写提示词”时,答案是:我不写。技能本身就是提示词。
数据是厚的。大脑仓库中有 10 万个结构化的知识页面。我接触过的每个人、公司、会议、书籍、文章和想法,全部链接、全部可搜索、每天都在增长。
代码是厚的。喂养它的代码(用于转录、OCR、社交媒体归档、日历同步、API 集成的脚本)也很重要,但数据才是复利价值的所在。我每天运行超过 100 个 cron 作业来检查所有东西:社交媒体、Slack、邮件,以及我关注的任何东西,我的 OpenClaw/Hermes Agents 也在帮我看。
模型是可互换的。我为了精确性使用 Opus 4.7 1M,为了召回和穷举提取使用 GPT-5.5,为了创造性工作和第三方视角使用 DeepSeek V4-Pro,为了速度使用 Groq + Llama。技能决定为哪个任务调用哪个模型,框架不在乎。当有人问“哪个 AI 模型最好”时,答案是:问错了问题。模型只是引擎,其他一切都是汽车。
People ask me about productivity. I don't think about it that way. What I think about is compounding.
Every meeting I take adds to the brain. Every book I read enriches the context for the next book. Every skill I build makes the next workflow faster. Every person page I update makes the next meeting prep sharper. The system today is 10x what it was two months ago, and two months from now it'll be 10x again.
When I'm still up at 2am coding (and I am, regularly, because AI gave me back the joy of building), I'm not just writing software. I'm adding to a system that gets better every hour. 100 cronjobs 24/7. The meeting ingestion runs automatically. The email triage runs every 10 minutes. The knowledge graph enriches itself from every conversation. The system processes daily transcripts and extracts patterns I missed in real time.
This is not a writing tool. It's not a search engine. It's not a chatbot. It's a second brain that actually works, not as a metaphor, but as a running system with 100,000 pages, 100+ skills, 15 cron jobs, and the accumulated context of every professional relationship, meeting, book, and idea I've engaged with in the last year.
I open-sourced the whole stack. GStack is the coding skill framework (87,000+ stars) that I used to build it. I still use it as a skill inside OpenClaw/Hermes Agent when the agent needs to code. There's a great programmable browser (both headed and headless) in there. GBrain is the knowledge infrastructure. OpenClaw and Hermes Agent are the harnesses, you should choose but I usually do both. The data repos are on GitHub.
The thesis is simple: the future belongs to individuals who build compounding AI systems, not to individuals who use corporate-owned centralized AI tools. The difference is the difference between keeping a journal and having a nervous system.
人们问我关于生产力的问题。我不那样思考。我思考的是复利。
我参加的每一次会议都会为大脑增加内容。我读的每一本书都会丰富下一本书的上下文。我构建的每一个技能都会让下一个工作流更快。我更新的每个人物页面都会让下一次会议准备更精准。今天的系统是两个月前的十倍,而两个月后,它还会再提升十倍。
当我凌晨两点还在编码时(我经常这样,因为 AI 让我重新找回了构建的乐趣),我不仅仅是在写软件。我是在为一个每小时都在变好的系统添砖加瓦。100 个 cron 作业 24/7 运行。会议摄入自动运行,邮件分类每 10 分钟运行一次,知识图谱从每次对话中自我丰富。系统处理每日转录,并实时提取我错过的模式。
这不是一个写作工具,不是搜索引擎,不是聊天机器人。这是一个真正有效的第二大脑,不是比喻意义上的,而是一个运行中的系统——包含 10 万个页面、100 多个技能、15 个 cron 作业,以及过去一年我参与过的每一次职业关系、会议、书籍和想法的累积上下文。
我开源了整个技术栈。GStack 是我用来构建它的编码技能框架(87,000+ 星标)。当代理需要编码时,我仍然在 OpenClaw/Hermes Agent 内部将其作为一个技能使用。里面有一个很棒的可编程浏览器(支持有头和无头模式)。GBrain 是知识基础设施。OpenClaw 和 Hermes Agent 是框架,你可以选择,但我通常两者都用。数据仓库在 GitHub 上。
论点很简单:未来属于那些构建复利型 AI 系统的个人,而不是使用企业拥有的集中化 AI 工具的个人。区别在于写日记和拥有神经系统之间的区别。
If you want to build this:
- Pick a harness. OpenClaw, Hermes Agent, or build your own from scratch with Pi. Keep it thin. The harness is just the router. Host it on your spare computer at home with Tailscale, or use Render or Railway in the cloud.
- Start a brain with GBrain. I got inspired by Karpathy's LLM Wiki, implemented it in OpenClaw, and extended it into GBrain. It's the best retrieval system I've benchmarked (97.6% recall on LongMemEval, beating MemPalace with no LLM in the retrieval loop) and it ships 39 installable skills including everything described in this article. One command to install. A git repo where every person, meeting, article, and idea gets a page.
- Do something interesting. Don't start by planning your skill architecture. Start by doing a thing. Write a report. Research a person. Download a season of NBA scores and build a prediction model for your sports bets. Analyze your portfolio. Whatever you actually care about. Do it with your agent, iterate until it's good, and then run Skillify (the meta-skill from earlier) to extract the pattern into a reusable skill. Then run check_resolvable to verify the new skill is wired into the resolver. That loop turns one-off work into compounding infrastructure.
- Keep using it and look at the output. The skill will be mediocre at first. That's the point. Use it, read what it produces, and when something is off, run cross-modal eval: send the output through multiple models and have them score each other on the dimensions you care about. That's how I caught the factual errors in book-mirror. The fix got baked into the skill, and every mirror since has been clean. In six months you'll have something no chatbot can replicate, because the value isn't in the model. It's in what you've taught the system about your specific life, work, and judgment. The first thing I built with this system was terrible. The hundredth was something I'd trust with my calendar, my inbox, my meeting prep, and my reading list. The system learned. I learned. The compound curve is real.
Fat skills. Fat code. Thin harness. The LLM on its own is just an engine. You can build your own car.
Everything I described here, all the skills, the book mirror pipeline, the cross-modal eval framework, the skillify loop, the resolver architecture, plus 30+ installable skillpacks, is open source and free on GitHub: github.com/garrytan/gbrain. Go build.
如果你想构建这个系统:
- 选择一个框架。OpenClaw、Hermes Agent,或者用 Pi 从头开始构建你自己的。保持框架轻量。框架只是路由器。把它托管在你家里的备用计算机上(用 Tailscale),或者在云端使用 Render 或 Railway。
- 用 GBrain 启动一个大脑。我从 Karpathy 的 LLM Wiki 获得灵感,在 OpenClaw 中实现并扩展为 GBrain。这是我 benchmark 过最好的检索系统(在 LongMemEval 上达到 97.6% 的召回率,在检索循环中没有使用 LLM 就击败了 MemPalace),它提供了 39 个可安装技能,包括本文描述的所有内容。一条命令安装。一个 git 仓库,每个人员、会议、文章和想法都有一个页面。
- 做点有趣的事情。不要从规划你的技能架构开始。从做一件事开始。写一份报告。研究一个人。下载一个赛季的 NBA 比分并构建体育博彩预测模型。分析你的投资组合。任何你真正在意的事情。用你的代理做,迭代直到做好,然后运行 Skillify(之前的元技能)将模式提取为可复用的技能。然后运行 check_resolvable 验证新技能已接入解析器。这个循环将一次性工作转变为复利型基础设施。
- 持续使用并查看输出。技能一开始会很平庸。这正是目的所在。使用它,阅读它产生的内容,当有不对的地方时,运行跨模型评估:将输出通过多个模型,让它们在你关心的维度上相互评分。我就是这样发现图书镜像中的事实错误的。修复被烘焙进技能中,从此每个镜像都很干净。六个月后,你将拥有任何聊天机器人都无法复制的东西,因为价值不在于模型,而在于你教给系统关于你特定生活、工作和判断力的内容。 我用这个系统构建的第一个东西很糟糕,第一百个则是我愿意托付日历、收件箱、会议准备和阅读清单的东西。系统学习了,我也学习了。复利曲线是真实的。
厚技能。厚代码。薄框架。LLM 本身只是引擎。你可以自己造车。
我在这里描述的所有东西——所有技能、图书镜像管道、跨模型评估框架、skillify 循环、解析器架构,以及 30 多个可安装技能包——都是开源的,在 GitHub 上免费提供:github.com/garrytan/gbrain。去构建吧。