Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Well, I don't have hardware to run a 141B parameters model, even if only 39B are active during inference.


It will be quantized in a matter of days and runnable on most laptops.


8 bit is 149G. 4 bit is 80G.

I wouldn’t call this runnable on most laptops.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: