Sitemap - 2025 - Latent.Space

[State of Code Evals] After SWE-bench, Code Clash & SOTA Coding Benchmarks recap — John Yang

[State of Post-Training] From GPT-4.1 to 5.1: RLVR, Agent & Token Efficiency — Josh McGrath, OpenAI

[State of RL/Reasoning] IMO/IOI Gold, OpenAI o3/GPT-5, and Cursor Composer — Ashvin Nair, Cursor

[State of AI Startups] Memory/Learning, RL Envs & DBT-Fivetran — Sarah Catanzaro, Amplify

One Year of MCP — with David Soria Parra and AAIF leads from OpenAI, Goose, Linux Foundation

Steve Yegge's Vibe Coding Manifesto: Why Claude Code Isn't It & What Comes After the IDE

⚡️GPT5-Codex-Max: Training Agents with Personality, Tools & Trust — Brian Fioca + Bill Chen, OpenAI

SAM 3: The Eyes for AI — Nikhila & Pengchuan (Meta Superintelligence), ft. Joseph Nelson (Roboflow)

⚡️Jailbreaking AGI: Pliny the Liberator & John V on Red Teaming, BT6, and the Future of AI Security

AI to AE's: Grit, Glean, and Kleiner Perkins' next Enterprise AI hit — Joubin Mirzadegan, Roadrunner

The Future of Email: Superhuman CTO on Your Inbox As the Real AI Agent (Not ChatGPT) — Loïc Houssier

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI

[Subscribers only] Dev Writers Retreat 2025: WRITING FOR HUMANS — 10 Fellowship spots left!

After LLMs: Spatial Intelligence and World Models — Fei-Fei Li & Justin Johnson, World Labs

⚡️ 10x AI Engineers with $1m Salaries — Alex Lieberman & Arman Hezarkhani, Tenex

The Agent Labs Thesis

Anthropic, Glean & OpenRouter: How AI Moats Are Built with Deedy Das of Menlo Ventures

Biohub for Non-Biologists: Behind Priscilla Chan and Mark Zuckerberg's plan to cure all diseases

⚡ Inside GitHub’s AI Revolution: Jared Palmer Reveals Agent HQ & The Future of Coding Agents

⚡ [AIE CODE Preview] Inside Google Labs: Building The Gemini Coding Agent — Jed Borovik, Jules

⚡️ Ship AI recap: Agents, Workflows, and Python — w/ Vercel CTO Malte Ubl

Agentic Commerce Protocol and building the Economic Infrastructure for AI — with Emily Glassberg Sands, Head of Data & AI at Stripe

Why RL Won — Kyle Corbitt, OpenPipe (acq. CoreWeave)

Developers as the distribution layer of AGI (OpenAI Dev Day 2025, ft. Sherwin Wu and Christina Huang)

DevDay 2025: Apps SDK, Agent Kit, MCP, Codex and why Prompting is More Important than Ever

Taste is your moat — with Dylan Field, Figma

Taste is your Moat (Dylan Field of Figma)

Amp: The Emperor Has No Clothes

How GPT5 + Codex took over Agentic Coding — ft. Greg Brockman, OpenAI

Context Engineering for Agents - Lance Martin, LangChain

A Technical History of Generative Media — with Gorkem and Batuhan from Fal.ai

Better Data is All You Need — Ari Morcos, Datology

"RAG is Dead, Context Engineering is King" — with Jeff Huber of Chroma

Can coding agents self-improve?

GPT-5's Vision Checkup: a frontier VLM, but not a new SOTA

GPT-5's Router: how it works and why Frontier Labs are now targeting the Pareto Frontier

GPT-5 Hands-On: Welcome to the Stone Age

Cline: The Open Source Code Agent — with Saoud Rizwan and Nik Pash

The RLVR Revolution — with Nathan Lambert (AI2, Interconnects.ai)

AI is Eating Search

Cline: the open source coding agent that doesn't cut costs

The Tiny Teams Playbook

Personalized AI Language Education — with Andrew Hsu, Speak

The Hyperstitions of Moloch

AI Video Is Eating The World — Olivia and Justine Moore, a16z

Information Theory for Language Models: Jack Morris

Scaling Test Time Compute to Multi-Agent Civilizations: Noam Brown

Scaling Test Time Compute to Multi-Agent Civilizations — Noam Brown, OpenAI

Andrej Karpathy on Software 3.0: Software in the Age of AI (UPDATED with Full Transcript)

The Shape of Compute — with Chris Lattner for Modular

AI Engineering Goes Mainstream

God is hungry for Context: First thoughts on o3 pro

The Utility of Interpretability — Emmanuel Amiesen, Anthropic

The Utility of Interpretability — Emmanuel Amiesen

[AIEWF Preview] Containing Agent Chaos — Solomon Hykes

AIEWF 2025 Online! (and Attendee Guide)

[AIEWF Preview] Gemini in 2025 and Realtime Voice AI

[AIEWF Preview] CloudChef: Your Robot Chef - Michellin-Star food at $12/hr (w/ Kitchen tour!)

Factory.ai: The A-SWE Droid Army

The AI Coding Factory

SWE Agents Too Cheap To Meter, The Token Data War, and the rise of Tiny Teams

[AIEWF Preview] Multi-Turn RL for Multi-Hour Agents — with Will Brown, Prime Intellect

⚡️Multi-Turn RL for Multi-Hour Agents — with Will Brown, Prime Intellect

ChatGPT Codex: The Missing Manual

Claude Code: Anthropic's Agent in Your Terminal

⚡️The Rise and Fall of the Vector DB Category

Please stop forcing Clippy on those who want Anton

Why Every Agent needs Open Source Cloud Sandboxes

AI Agents, meet Test Driven Development

In the Matter of OpenAI vs LangGraph

AI Engineer Speaker Applications Close This Weekend (for AIE SF, Jun 3-5)

⚡️GPT 4.1: The New OpenAI Workhorse

⚡️GPT 4.1: The New OpenAI Workhorse

SF Compute: Commoditizing Compute to solve the GPU Bubble forever

The Creators of Model Context Protocol

Unsupervised Learning x Latent Space Crossover Special

The Agent Network — Dharmesh Shah

Agent Engineering

Building Snipd: The AI Podcast App for Learning

⚡️The new OpenAI Agents Platform

Why MCP Won

⚡️How Claude 3.7 Plays Pokémon

Open Operator, Serverless Browsers and the Future of Computer-Using Agents

AI Engineer Summit Online (UPDATED)

The Inventors of Deep Research

Bee AI: The Wearable Ambient Agent

The AI Architect — Bret Taylor

Agent Engineering with Pydantic + Graphs — with Samuel Colvin

LLM Gateway: The One Decision That Removes 100 AI Engineering Decisions

The Agent Reasoning Interface: o1/o3, Claude 3, ChatGPT Canvas, Tasks, and Operator — with Karina Nguyen of OpenAI

Outlasting Noam Shazeer, crowdsourcing Chai AI with >1.4m DAU, and becoming the "Western DeepSeek" — with William Beauchamp, Chai Research

Why o3-mini *had* to be free: the coming DeepSeek R1, 2.0 Flash, and Sky-T1 Price War

Everything you need to run Mission Critical Inference (ft. DeepSeek v3 + SGLang)

[Ride Home] Simon Willison: Things we learned about LLMs in 2024

o1 isn’t a chat model (and that’s the point)

Beating Google at Search with Neural PageRank and $5M of H200s — with Will Bryk of Exa.ai

AI Engineering for Art — with comfyanonymous, of ComfyUI

Announcing AI Engineer Summit NYC: All in on Agent Engineering + Leadership