Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
amelius
11 days ago
|
parent
|
context
|
favorite
| on:
OpenAI claims gold-medal performance at IMO 2025
This is not a benchmark, really. It's an official test.
PokemonNoGo
11 days ago
|
next
[–]
What is an _official_ test?
reply
andrepd
11 days ago
|
prev
[–]
And what were the methods? How was the evaluation? They could be making it all up for all we know!
reply
Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: