Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

This is not a benchmark, really. It's an official test.




What is an _official_ test?

And what were the methods? How was the evaluation? They could be making it all up for all we know!



Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: