God is hungry for Context: First thoughts on…

Jun 10

OpenAI dropped o3 pricing 80% today and launched o3-pro. Ben Hylak of Raindrop.ai returns with Alexis Dauba for the world's first early review.

Read →

14 Comments

Eray

3dEdited

LLMs need to ask more questions. There is too much focus on taking up a task and making autonomous decisions when there are gaps. Instead the priority should be to help the user to refine the task first.

I’ve seen more questions with recent models but it’s still scratching the surface.

Expand full comment

Reply (3)

Pamela Wang, PhD

I also prompt Gemini in Google AI Studios to ask me the question it needs in order to give me a good plan.

Rather than waiting for the model to ask you questions on its own. You can just change the default mode. Because this is more ‘system prompt’ than model design.

Expand full comment

Carl Rannaberg

In Cursor I have defaulted to Gemini 2.5 Pro as my planner model, which I prompt to ask me questions. It has large context, it reasons and you can have iterative conversations with it, unlike the o-series models.

Expand full comment

Reply (1)

Eray

Interesting, I’m using a similar setup, but usually refine outside the IDE.

Expand full comment

Ben Hylak

agreed

Expand full comment

Suraj

Considering how different models require different type of prompting and context, task usage, etc to make it function optimally, is there a tool that can help select the appropriate model, tune the prompt and then run it? Like a meta reasoning model sitting atop a library of models.

Expand full comment

John

What have you seen re:hallucination? My biggest issue with using o3 vs Claude sonnet/opus is that even though the output for o3 is often great I need to fact check absolutely everything. The o series seems much more willing to 'lie' / hallucinate than Claude or even 4o.

Expand full comment

Reply (1)

Ben Hylak

I don't have evals to prove this yet, but: I think that without enough context, and without tools, it will definitely more readily overthink + hallucinate.

Expand full comment

RonC

Excellent Article. I've noticed that providing expansive context to 4o also improves its ability to provide useful detailed plans and recommendations. So it seems reasonable that more powerful models like o1, o3, and o3-Pro would benefit even more from expanded context information. Sometimes I just dictate to it for 5 minutes all the background info I can think of. Works great. I've been using the approach you outlined in one of our other articles on how to structure the prompt for o1 et al and it's really helped. Since I haven't had access to o3-pro yet, I really appreciate the information you've provided in this article. Super helpful. Thank you.

Expand full comment

Tom

Fuck off.. God has nothing to do with that

Expand full comment

Thomas Mustier

2dEdited

Very very cool. How did you share context while keeping it usable to the model & within window? How much pre-selection & pre-cleaning did you do, and how?

Expand full comment

SorenJ

Does it have that same used-car salesman feel like the other OAI models?

Expand full comment

Joel Dietz

Would be nice to see a "let her rip" prompt that we can see the output of.

Expand full comment

Leith Pierre

Any clear alignment concerns if you tested for that? Big issue with o3 was that its unaligned and it'd be real bad if it carried over to here

Expand full comment