Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Haven't read the paper, but they are probably using something like sentencepiece with sub-word splitting and then charge by the number of resulting token.

https://github.com/google/sentencepiece



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: