The Bitter Lesson vs Agent Harnesses & World Model, Solving Poker and Diplomacy, Debating RL+Reasoning with Ilya, what's *wrong* with the System 1/2 analogy, and the challenges of Test-Time Scaling
Share this post
Scaling Test Time Compute to Multi-Agent…
Share this post
The Bitter Lesson vs Agent Harnesses & World Model, Solving Poker and Diplomacy, Debating RL+Reasoning with Ilya, what's *wrong* with the System 1/2 analogy, and the challenges of Test-Time Scaling