Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Could this be a model with GPT-2 architecture (all dense decoder only transformer) trained on GPT-4 dataset with GPT-4 tokeniser?


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: