Glean 拾遗
Daily · timeline

A few picks a day.

Sat, Jun 6, 2026 3picks
← 06-05
Calendar ▾
2026 · 06
MoTuWeThFrSaSu 123456789101112131415161718192021222324252627282930
has picks today
06-07 →
06:00

What Really Differentiates LLMs Happens After Pretraining: A Full Post-Training Pipeline Breakdown

大模型真正拉开差距的地方在预训练之后:一条后训练链路的完整拆解

A comprehensive deep-dive into the full LLM training pipeline, arguing that the real capability gap in 2026 lies not in pretraining but in the post-training stack: instruction tuning, RL, reward design, Agent training, and distillation. The article breaks down the end-to-end process step-by-step — from data recipes and system architecture constraints, through the four-stage post-training pipeline (Cold Start SFT → GRPO-based Reasoning RL → Rejection Sampling FT → Alignment RL), Grader/Reward evaluation loops, Agent training with PARL and Meta-Harness, to distillation and deployment. Key engineering insights include DeepSeek-R1's public recipe, why GRPO simplifies PPO by removing the value network, PRM vs ORM trade-offs, and the shift from optimizing answers to optimizing harness programs. Targeted at engineers who want to trace concrete capability gains back to specific training stages.

tw93.fun · 27 min · Agents · LLM · Performance
06:00

AI Amplifies Output, Not Input: My /learn Workflow for Deep Technical Dives

AI 放大的是输出,不是输入:如何用 /learn 流程深入学习一个技术领域

The author shares a personal workflow for deep learning in the AI era: treat learning like coding, structured as collect → filter → outline → draft → AI-assisted tightening → self-review. The core argument is that AI's real value lies in amplifying your output, not in summarizing input. Using a recent deep dive into LLM training as an example, the post introduces the /learn skill in the open-source Waza toolkit to industrialize this process. Recommended for engineers wondering how to maintain depth while leveraging AI.

tw93.fun · 2 min · AI · Framework
06:00

Building an AI Second Brain with Claude and Obsidian: The Complete Tutorial

用 Claude 和 Obsidian 搭建 AI 第二大脑:从零到可用的完整教程

A hands-on tutorial on connecting Claude to an Obsidian vault, turning your notes into a queryable knowledge engine that reasons over your own context. Covers vault structure (PARA method), AI-first note design, three Claude integration methods (Projects upload, Claude Code direct access, MCP servers), and five ready-to-use workflows (weekly digest, research synthesis, idea connection, knowledge gap auditing, daily briefing). Best for developers, researchers, and knowledge workers building a persistent personal knowledge system.

x.com · 13 min · Agents · AI · Framework