3 Comments
User's avatar
Alex Lastovetskiy's avatar

The AI engineering focus here fills a real gap in the ecosystem.

Liberty's avatar

Great interview, I really enjoyed it! 💚 🥃

Pawel Jozefiak's avatar

Jeff's framing of energy cost in picojoules as the real optimization target reframes how I think about model selection entirely. I spent weeks obsessing over benchmark scores before realizing the actual cost driver in my AI agent was data movement, not compute. When I switched from Opus for everything to routing tasks by complexity — Haiku for reads/searches, Opus for multi-step reasoning - costs dropped 80%.

The hierarchy logic matches Jeff's Flash vs Pro model: distill capability downward, not just shrink the model. Documented the actual numbers here: https://thoughts.jock.pl/p/claude-model-optimization-opus-haiku-ai-agent-costs-2026