A lot of people are making the mistake of noticing that local models have been 1...

am17an · 2026-04-13T05:46:48 1776059208

Don’t underestimate the march of technology. Just look at your phone, it has more FLOPS than there were in the entire world 40 years ago.

kuboble · 2026-04-13T05:58:01 1776059881

And I think it's very likely that with improved methods you could get opus 4.6 level performance on a wrist watch in few years.

You needed supercomputer to win in chess until you didn't.

Currently local models performance in natural language is much better than any algorithm running on a super computer cluster just few years ago.

root_axis · 2026-04-13T08:09:27 1776067767

Yeah, but that's the current state of the art after decades of aggressive optimizations, there's no foreseeable future where we'll ever be able to cram several orders of magnitude more ram into a phone.

TeMPOraL · 2026-04-13T09:32:26 1776072746

We already cram several orders of magnitude more flash storage into phone than RAM (e.g. my phone has 16 GB RAM but 1 TB storage); even now, with some smart coding, if you don't need all that data at the same time for random access at sub millisecond speed, it's hard to tell the difference.

alwillis · 2026-04-13T17:31:34 1776101494

Agreed. Apple is sells an iPhone Pro Max with 2 TB of storage.

vrighter · 2026-04-13T09:38:59 1776073139

but it doesn't have that much more flops than it did a couple of years ago.

colechristensen · 2026-04-13T05:54:22 1776059662

There's been plenty of free lunch shrinking models thus far with regards to capability vs parameter count.

Contradicting that trend takes more than "It simply.. doesn't."

There's plenty of room for RAM sizes to double along with bus speed. It idled for a long time as a result of limited need for more.

slopinthebag · 2026-04-13T16:43:12 1776098592

The gap between SOTA models and open / local models continues to diminish as SOTA is seeing diminishing returns on scaling (and that seems to be the main way they are "improving"), whereas local models are making real jumps. I'm actually more optimistic local models will catch up completely than I am SOTA will be taking any great leaps forward.

grumbel · 2026-04-13T10:33:01 1776076381

Would the model even need that breath of knowledge? Humans just look things up in books or on Wikipedia, which you can store on a plain old HDD, not VRAM. All books ever written fit into about 60TB if you OCR them, and the useful information in them probably in a lot less, that's well within the range of consumer technology.

baq · 2026-04-13T07:59:51 1776067191

Pretty sure there’s at least a couple orders of magnitude in purely algorithmic areas of LLM inference; maybe training, too, though I’m less confident here. Rationale: meat computers run on 20W, though pretraining took a billion years or so.