Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
jerf
on April 5, 2023
|
parent
|
context
|
favorite
| on:
Using mmap to make LLaMA load faster
"Blatant lie" seems a bit strong. Running a large model for a second time in a row is a pretty common use case and that speedup strikes me as real in that common case. Attribution may have been wrong but the time saved is real.
gmork13
on April 5, 2023
[–]
So it can load twice as large models somehow?
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: