2 Comments
User's avatar
Neil Warren's avatar

You might want to be using o3 for this? Afaict "Deep Research" is just a wrapper around o3 which writes a giant chain of thought that's then summarized by another model into the fancy report you get.

(Here's a deep research report on deep research I made soon after DR came out: https://chatgpt.com/s/dr_685312b70aa48191964860333e0fee56), that's my source here, primary source is the spec card I think.

Also uh be careful not to get goodharted, now that the people submitting essentially know every one of your criteria and that an LLM is doing first pass. I'm really not sure this is a good idea vs. finding and paying high taste humans to do it.

Expand full comment
Neil Warren's avatar

Oh yeah more on o3, as the DR report points out and as OAI has shone, o3 was heavily RLed in the "do internet searches" direction which is why e.g. it's great at rainbolting (geoguessing).

This is definitely not the case for Perplexity (which only offers scaffolding or a fine-tuned R1) or Opus (which ime is terrible at conducting good search).

Expand full comment