Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

>there is a bit of non-determinism in batched non-associative math that can vary by batch / hardware

Maybe a dumb question but does this mean model quality may vary based on which hardware your request gets routed to?





Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: