Discussion about this post

User's avatar
Paull Young's avatar

This is a superb read. I appreciate the ‘scientist’ vs ‘simulator’ breakdown.

One Q: where do you feel we stand when it comes to open, accessible data underpinning AI approaches? In the piece I note you say both “The accessibility of data in other domains is a largely solved problem.” (Outside of biology), but later you say “The big labs are focused on intelligence—reasoning, long context, tool use. Domain-specific simulation and data collection are massive undertakings that lie outside their core competencies and business models.”

Shashwat Goel's avatar

Training AI to help with science when simulators aren't available was exactly the motivation of https://arxiv.org/abs/2512.23707 :)

8 more comments...

No posts

Ready for more?