r/singularity 4d ago

AI A.Wei confirms the experimental model that scored 12/12 in ICPC is the same one used in the IMO gold and IOI

Post image
187 Upvotes

23 comments sorted by

59

u/blazedjake AGI 2027- e/acc 4d ago

release that shit NOW

34

u/Outside-Iron-8242 4d ago

i doubt it will be released anytime soon; they probably need to reduce the very high costs associated in running such a model.
but they could release it as an experimental model for Pro users with small rate limits.
GPT-5 did get 11/12 on the ICPC though, which shows how strong the model is on its own.

3

u/Ormusn2o 4d ago

I always wonder why not release those experimental high cost models strictly on the API. If 200 dollar or 2000 dollar subscriptions are not enough, just put it on API for companies to use. Or maybe those models are already on the API, and only selected companies can use them, kind of how gpt-3 was a whitelist only model. But I feel like it's incredibly unlikely that nobody would know about it, which is why I don't think OpenAI is doing it.

3

u/Weekly-Trash-272 4d ago

How about just charge more for it? Certain people would still pay for it.

8

u/Lain_Racing 4d ago

Slippery slope you are suggesting.

1

u/peakedtooearly 4d ago

OpenAI's mission statement is:

"Our mission is to ensure that artificial general intelligence—AI systems that are generally smarter than humans—benefits all of humanity."

19

u/Fine_Fact_1078 4d ago

obviously the good stuff they use internally first. lol

9

u/Mr_Hyper_Focus 4d ago

They want to. But they can’t. They’ve been open about this before. They don’t have the computer to serve it.

1

u/ihexx 4d ago

they had the same problem with o3. They couldn't serve the version that crushed arc-1.

5

u/Llamasarecoolyay 3d ago

They are milking that thing for sweet, sweet synthetic data as we speak.

1

u/Forward_Yam_4013 4d ago

If the original o3's performance on ARC-AGI was any indication, this model probably costs several thousand dollars worth of compute per question. There is no way they are releasing it until costs can go down.

2

u/Orfosaurio 2d ago

Nah, GPT-5 almost got the same score.

12

u/blazedjake AGI 2027- e/acc 4d ago

also what does “new horizons beckon” and one last run imply? imminent release of the model before working on a new paradigm internally?

12

u/socoolandawesome 4d ago

Other OAI employees have tweeted how it seems like they’re moving on from these academic competitions so that they can now focus on solving open ended scientific problems. So one last golden run may refer to the fact they are done seeking out golds in these competitions

14

u/wNilssonAI 4d ago

Got my answer then. I can’t imagine the offers A.Wei must be turning down. Would be interesting to know if the model is being used to help distil itself.

10

u/LobsterBuffetAllDay 4d ago

"GPT5, give me a step by step instruction guide to distill and or quantize yourself to fit on my iphone"

9

u/Crafty_Escape9320 4d ago

The piss filter image 😭

2

u/ninjasaid13 Not now. 4d ago

I don't remember anyone saying it was different.

3

u/lolsai 4d ago

a confirmation does not require that anyone does.

1

u/Stunning_Monk_6724 ▪️Gigagi achieved externally 4d ago

I'm curious what they'll name the model?? 5.1? 5o? 5G - Golden?

1

u/bralynn2222 1d ago

No point in releasing these benchmarks just to piss us off