Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

tons and tons of papers, most of them had some disadvantages. Can't have the cake and eat it too:

https://arxiv.org/html/2404.08801v1 Meta Megalodon

https://arxiv.org/html/2404.07143v1 Google Infini-Attention

https://arxiv.org/html/2402.13753v1 LongRoPE

and a ton more



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: