We dropped Claude. It's pretty clear this is a race to the bottom, and we don't ...

ahartmetz · 2026-04-18T17:21:08 1776532868

>we don't want a hard dependency on another multi-billion dollar company just to write software

One of two main reasons why I'm wary of LLMs. The other is fear of skill atrophy. These two problems compound. Skill atrophy is less bad if the replacement for the previous skill does not depend on a potentially less-than-friendly party.

post-it · 2026-04-18T17:47:13 1776534433

I was worried about skill atrophy. I recently started a new job, and from day 1 I've been using Claude. 90+% of the code I've written has been with Claude. One of the earlier tickets I was given was to update the documentation for one of our pipelines. I used Claude entirely, starting with having it generate a very long and thorough document, then opening up new contexts and getting it to fact check until it stopped finding issues, and then having it cut out anything that was granular/one query away. And then I read what it had produced.

It was an experiment to see if I could enter a mature codebase I had zero knowledge of, look at it entirely through an AI, and come to understand it.

And it worked! Even though I've only worked on the codebase through Claude, whenever I pick up a ticket nowadays I know what file I'll be editing and how it relates to the rest of the code. If anything, I have a significantly better understanding of the codebase than I would without AI at this point in my onboarding.

estetlinus · 2026-04-18T18:08:15 1776535695

Yeah, +1. I will never be working on unsolved problems anyhow. Skill atrophy is not happening if you stay curious and responsible.

stringfood · 2026-04-18T18:28:55 1776536935

I have never learned so quickly in my entire life than to post a forum thread in its entirety into a extended think LLM and then be allowed to ask free form questions for 2 hours straight if I want to. Having my questions answered NOW is so important for me to learn. Back in the day by the time I found the answer online I forgot the question

lobf · 2026-04-18T19:36:47 1776541007

Same. I work in the film industry, but I’ve always been interested in computers and have enjoyed tinkering with them since I was about 5. However, coding has always been this insurmountably complicated thing- every time I make an effort to learn, I’m confronted with concepts that are difficult for me to understand and process.

I’ve been 90% vibe coding for a year or so now, and I’ve learned so much about networking just from spinning up a bunch of docker containers and helping GPT or Claude fix niggling issues.

I essentially have an expert (well, maybe not an expert but an entity far more capable than I am on my own) who’s shoulder I can look over and ask as many questions I want to, and who will explain every step of the process to me if I want.

I’m finally able to create things on my computer that I’ve been dreaming about for years.

estetlinus · 2026-04-19T17:06:01 1776618361

I pivoted from the film industry into AI 10 years ago. My end game is to replace movie magic.

idopmstuff · 2026-04-18T18:34:22 1776537262

Some people talk like skill atrophy is inevitable when you use LLMs, which strikes me as pretty absurd given that you are talking about a tool that will answer an infinite number of questions with infinite patience.

I usually learn way more by having Claude do a task and then quizzing it about what it did than by figuring out how to do it myself. When I have to figure out how to do the thing, it takes much more time, so when I'm done I have to move on immediately. When Claude does the task in ten minutes I now have several hours I can dedicate entirely to understanding.

onemoresoop · 2026-04-18T18:42:56 1776537776

You lose some, you win some. The win could be short-term much higher, however imagine that the new tool suddenly gets ragged pulled from under your feet. What do you do then? Do you still know how to handle it the old way or do you run into skill atrophy issues? I’m using Claude/Codex as well, but I’m a little worried that the environment we work in will become a lot more bumpy and shifty.

post-it · 2026-04-19T01:08:45 1776560925

> however imagine that the new tool suddenly gets ragged pulled from under your feet

When you have a headache, do you avoid taking ibuprofen because one day it may not be available anymore? Two hundred years ago, if you gave someone ibuprofen and told them it was the solution for 99% of the cases where they felt some kind of pain, they might be suspicious. Surely that's too good to be true.

But it's not. Ibuprofen really is a free lunch, and so is AI. It's weird to experience, but these kinds of technologies come around pretty often, they just become ubiquitous so quickly that we forget how we got by without them.

visarga · 2026-04-18T18:48:08 1776538088

> the new tool suddenly gets ragged pulled from under your feet

If that happened at this point, it would be after societal collapse.

onemoresoop · 2026-04-18T19:03:45 1776539025

I don’t even wanna think about that scenario, maybe he gets averted somehow.

hdjrudni · 2026-04-18T20:23:31 1776543811

The "infinite patience" thing I find particularly interesting.

Every now and then I pause before I ask an LLM to undo something it just did or answer something I know it answered already, somewhere. And then I remember oh yeah, it's an LLM, it's not going to get upset.

dlopes7 · 2026-04-18T20:01:05 1776542465

Asking infinite questions about something does not make you good at “doing” that thing, you get pretty good at asking questions

techpression · 2026-04-18T20:17:02 1776543422

Understanding is not learning. Zero effort gives zero rewards, I ask Claude plenty of things, I get answers but not learnings.

bdangubic · 2026-04-18T19:04:50 1776539090

I used to speak Russian like I was born in Russia. I stopped talking Russian … every day I am curious ans responsible but I can hardly say 10 words in Russian today. if you don’t use it (not just be curious and responsible) you will lose it - period.

thih9 · 2026-04-18T19:33:57 1776540837

Programming language is not just syntax, keywords and standard libraries, but also: processes, best practices and design principles. The latter group I guess is more difficult to learn and harder to forget.

bdangubic · 2026-04-18T20:37:30 1776544650

I respectfully completely disagree. not only will you just as easily lose thr processed, best practices and design principles but they will be changing over time (what was best practice when I got my first gig in 1997 is not a best practice today (even just 4-5 years ago not to go all the back to the 90’s)). all that is super easy to both forget and lose unless you live it daily

thih9 · 2026-04-19T05:02:19 1776574939

Forget, yes; lose, no. Like it would be much easier for you to relearn Russian - especially compared with someone who only knows English.

ashirviskas · 2026-04-19T00:51:36 1776559896

More fair comparison would be writing/talking about Russian language in English. That way you'd still focus on Russian. Same way with programming - it's not like you stop seeing any code. So why should you forget it?

SpicyLemonZest · 2026-04-18T18:14:10 1776536050

Are you sure you would know if it didn't work? I use Claude extensively myself, so I'm not saying this from a "hater" angle, but I had 2 people last week who believe themselves to be in your shoes send me pull requests which made absolutely no sense in the context of the codebase.

therealdrag0 · 2026-04-18T18:21:25 1776536485

That’s always been the case, AI or not.

Jweb_Guru · 2026-04-18T20:18:03 1776543483

No, it hasn't. I did not have a problem before AI with people sending in gigantic pull requests that made absolutely no sense, and justifying them with generated responses that they clearly did not understand. This is not a thing that used to happen. That's not to say people wouldn't have done it if it were possible, but there was a barrier to submitting a pull request that no longer exists.

viccis · 2026-04-18T21:01:55 1776546115

In my experience, the people sending me garbage PRs with Claude are the same ones who wrote garbage code beforehand. Now there's just 10x more of it.

windexh8er · 2026-04-18T19:52:24 1776541944

It just happens to be a lot worse now. Confidence through ignorance has come into the spotlight with the commoditization of LLMs.

post-it · 2026-04-19T01:02:35 1776560555

Yeah, I test everything myself.

root_axis · 2026-04-18T19:58:02 1776542282

I have also found LLMs are a great tool for understanding a new code base, but it's not clear to me what your comment has to do with skill atrophy.

post-it · 2026-04-19T02:13:20 1776564800

Well ultimately the skill I care about is understanding software, changing it, and making more of it. And clearly that isn't atrophying.

My syntax writing skills may well be atrophying, but I'll just do a leetcode by hand once in a while.

viccis · 2026-04-18T21:15:50 1776546950

It's good that it's working for you but I'm not sure what this has to do with skill atrophy. It sounds like you never had this skill (in this case, working with that particular system) to begin with.

>I have a significantly better understanding of the codebase than I would without AI at this point in my onboarding

One of the pitfalls of using AI to learn is the same as I'd see students doing pre-AI with tutoring services. They'd have tutors explain the homework to do them and even work through the problems with them. Thing is, any time you see a problem or concept solved, your brain is tricked into thinking you understand the topic enough to do it yourself. It's why people think their job interview questions are much easier than they really are; things just seem obvious when you've thought about the solution. Anyone who's read a tutorial, felt like they understood it well, and then struggled for a while to actually start using the tool to make something new knows the feeling very well. That Todo List app in the tutorial seemed so simple, but the author was making a bunch of decisions constantly that you didn't have to think about as you read it.

So I guess my question would be: If you were on a plane flight with no wifi, and you wanted to do some dev work locally on your laptop, how comfortable would you be vs if you had done all that work yourself rather than via Claude?

post-it · 2026-04-19T01:04:29 1776560669

> If you were on a plane flight with no wifi, and you wanted to do some dev work locally on your laptop, how comfortable would you be vs if you had done all that work yourself rather than via Claude?

Probably about as comfortable as I would be if I also didn't have my laptop and instead had to sketch out the codebase in a notebook. There's no sense preparing for a scenario where AI isn't available - local models are progressing so quickly that some kind of AI is always going to be available.

viccis · 2026-04-19T19:33:21 1776627201

So then the argument isn't so much that skill decay isn't an issue but rather that the skill is inherently worthless moving forward. I'm not sure I agree, but I also got a compsci education because I have loved doing it since childhood rather than because I just wanted to make money, and I can see how the latter group would vehemently disagree with me.

Ifkaluva · 2026-04-18T20:08:05 1776542885

What do you mean “cut out anything that was granular/one query away”? This was a very cool workflow to hear about—I will be applying it myself

post-it · 2026-04-19T01:01:54 1776560514

For example, Claude was very eager to include function names, implementation details, and the exact variables that are passed between services. But all the info I need for a particular process is the names of the services involved, the files involved, and a one-sentence summary of what happens. If I want to know more, I can tell Claude to read the doc and find out more with a single query (or I can just check for myself).

ljm · 2026-04-18T17:44:33 1776534273

Not so much atrophy as apathy.

I've worked with people who will look at code they don't understand, say "llm says this", and express zero intention of learning something. Might even push back. Be proud of their ignorance.

It's like, why even review that PR in the first place if you don't even know what you're working with?

psygn89 · 2026-04-18T18:01:24 1776535284

I cringed when I saw a dev literally copy and paste an AI's response to a concern. The concern was one that had layers and implications to it, but instead of getting an answer as to why it was done a certain way and to allay any potential issues, that dev got a two paragraph lecture on how something worked on the surface of it, wrapped in em dashes and joviality.

A good dev would've read deeper into the concern and maybe noticed potential flaws, and if he had his own doubts about what the concern was about, would have asked for more clarification. Not just feed a concern into AI and fling it back. Like please, in this day and age of AI, have the benefit of the doubt that someone with a concern would have checked with AI himself if he had any doubts of his own concern...

oremj · 2026-04-18T18:21:37 1776536497

Is this the same subset of people who copy/paste code directly from stack overflow without understanding ? I’m not sure this is a new problem.

foobarchu · 2026-04-18T18:51:50 1776538310

It's a new problem in the sense that now executive management at many (if not most) software companies is pushing for all employees to work this way as much as possible. Those same people probably don't know what stack overflow even is.

pizza234 · 2026-04-18T18:34:35 1776537275

In my experience, no - I think the ability to build more complete features with less/little/no effort, rather than isolated functions, is (more) appealing to (more) developers.

malnourish · 2026-04-18T18:49:46 1776538186

I don't think so. I'll spend a ton of time and effort thinking through, revising, and planning out the approach, but I let the agent take the wheel when it comes to transpiling that to code. I don't actually care about the code so long as it's secure and works.

I spent years cultivating expertise in C++ and .NET. And I found that time both valuable and enjoyable. But that's because it was a path to solve problems for my team, give guidance, and do so with both breadth and depth.

Now I focus on problems at a higher level of abstraction. I am certain there's still value in understanding ownership semantics and using reflection effectively, but they're broadly less relevant concerns.

dingaling · 2026-04-18T19:27:42 1776540462

It's difficult to copy & paste an entire app from Stack Overflow

sroussey · 2026-04-18T19:21:54 1776540114

Copied and pasted without noting the license that stack overflow has on code published there, no doubt

trinsic2 · 2026-04-18T18:31:06 1776537066

Hey. I resemble that remark sometimes!! quit being a hater (sarcasm) :P

kilroy123 · 2026-04-18T18:04:10 1776535450

We've had such developers around, long before LLMs.

ohazi · 2026-04-18T18:20:52 1776536452

They're so much louder now, though.

RexM · 2026-04-18T18:17:41 1776536261

It’s a lot like someone bragging that they’re bad at math tossing around equations.

monkpit · 2026-04-18T18:09:03 1776535743

If I wanted to know what the LLM says, I would have asked it myself, thanks…

redanddead · 2026-04-18T18:12:21 1776535941

What is it in the broader culture that's causing this?

groundzeros2015 · 2026-04-18T18:27:07 1776536827

People who got into the job who don’t really like programming

drivebyhooting · 2026-04-18T18:32:44 1776537164

I like programming, but I don’t like the job.

groundzeros2015 · 2026-04-18T19:55:55 1776542155

Then why are you letting Claude do the fun part?

root_axis · 2026-04-18T20:00:08 1776542408

Obviously, the fun part is delivering value for the shareholders.

mattgreenrocks · 2026-04-18T18:18:07 1776536287

These people have always existed. Hell, they are here, too. Now they have a new thing to delegate responsibility to.

And no, I don't understand them at all. Taking responsibility for something, improving it, and stewarding it into production is a fantastic feeling, and much better than reading the comment section. :)

tossandthrow · 2026-04-18T17:28:12 1776533292

You can argu that you will have skill atrophy by not using LLMs.

We have gone multi cloud disaster recovery on our infrastructure. Something I would not have done yet, had we not had LLMs.

I am learning at an incredible rate with LLMs.

mgambati · 2026-04-18T17:32:47 1776533567

I kind feel the same. I’m learning things and doing things in areas that would just skip due to lack of time or fear.

But I’m so much more detached of the code, I don’t feel that ‘deep neural connection’ from actual spending days in locked in a refactor or debugging a really complex issue.

I don’t know how a feel about it.

Fire-Dragon-DoL · 2026-04-18T17:38:45 1776533925

I strongly agree on the refactor, but for debugging I have another perspective: I think debugging is changing for the better, so it looks different.

Sure, you don't know the code by heart, but people debugging code translated to assembly already do that.

The big difference is being able to unleash scripts that invalidate enormous amount of hypothesis very fast and that can analyze the data.

Used to do that by hand it took hours, so it would be a last resort approach. Now that's very cheap, so validating many hypothesis is way cheaper!

I feel like my "debugging ability" in terms of value delivered has gone way up. For skill, it's changing. I cannot tell, but the value i am delivering for debugging sessions has gone way up

afzalive · 2026-04-18T17:37:52 1776533872

As someone who's switched from mobile to web dev professionally for the last 6 months now. If you care about code quality, you'll develop that neural connection after some time.

But if you don't and there's no PR process (side projects), the motivation to form that connection is quite low.

hombre_fatal · 2026-04-18T19:00:17 1776538817

> If you care about code quality, you'll develop that neural connection after some time.

No, because you can get LLMs to produce high quality code that has gone through an infinite number of refinement/polish cycles and is far more exhaustive than the code you would have written yourself.

Once you hit that point, you find yourself in a directional/steering position divorced from the code since no matter what direction you take, you'll get high quality code.

afzalive · 2026-04-20T20:45:23 1776717923

Only if never find opportunities to simplify the code it's writing and you don't review the code at all.

> no matter what direction you take, you'll get high quality code

This is not the case today. You get medium-quality, sometimes over-engineered code 10x faster.

ori_b · 2026-04-18T17:37:43 1776533863

Yes, you certainly can argue that, but you'd be wrong. The primary selling point of LLMs is that they solve the problem of needing skill to get things done.

tossandthrow · 2026-04-18T17:40:38 1776534038

That is not the entire selling point - so you are very wrong.

You very much decide how you employ LLMs.

Nobody are keeping a gun to your head to use them. In a certain way.

Sonif you use them in a way that increase you inherent risk, then you are incredibly wrong.

ori_b · 2026-04-18T17:47:19 1776534439

I suggest you read the sales pitches that these products have been making. Again, when I say that this is the selling point, I mean it: This is why management is buying them.

SpicyLemonZest · 2026-04-18T18:01:59 1776535319

I've read the sales pitches, and they're not about replacing the need for skill. The Claude Design announcement from yesterday (https://www.anthropic.com/news/claude-design-anthropic-labs) is pretty typical in my experience. The pitch is that this is good for designers, because it will allow them to explore a much broader range of ideas and collaborate on them with counterparties more easily. The tool will give you cool little sliders to set the city size and arc width, but it doesn't explain why you would want to adjust these parameters or how to determine the correct values; that's your job.

I understand why a designer might read this post and not be happy about it. If you don't think your management values or appreciates design skill, you'd worry they're going to glaze over the bullet points about design productivity, and jump straight to the one where PMs and marketers can build prototypes and ignore you. But that's not what the sales pitch is focused on.

ori_b · 2026-04-18T18:19:09 1776536349

The majority of examples in the document you linked describe 'person without<skill> can do thing needing <skill>'. It's very much selling 'more output, less skill'

trinsic2 · 2026-04-18T18:35:06 1776537306

Sales pitches dont mean jack, WTF are you talking about?

foobarchu · 2026-04-18T18:53:43 1776538423

Sales pitches are literally the same thing as "the selling point".

Neither of those is necessarily a synonym for why you personally use them

Forgeties79 · 2026-04-18T18:04:13 1776535453

They purportedly solve the problem of needing skill to get things done. IME, this is usually repeated by VC backed LLM companies or people who haven’t knowingly had to deal with other people’s bad results.

This all bumps up against the fact that most people default to “you use the tool wrong” and/or “you should only use it to do things where you already have firm grasp or at least foundational knowledge.”

It also bumps against the fact that the average person is using LLM’s as a replacement for standard google search.

andy_ppp · 2026-04-18T17:52:34 1776534754

I see it completely the opposite way, you use an LLM and correct all its mistakes and it allows you to deliver a rough solution very quickly and then refine it in combination with the AI but it still gets completely lost and stuck on basic things. It’s a very useful companion that you can’t trust, but it’s made me 4-5x more productive and certainly less frustrated by the legacy codebase I work on.

trinsic2 · 2026-04-18T18:34:09 1776537249

Yeah I whole hardheartedly disagree with this. Because I understand the basics of coding I can understand where the model gets stuck and prompt it in other directions.

If you don't know whats going on through the whole process, good luck with the end product.

weego · 2026-04-18T19:00:29 1776538829

You're learning at your standard rate of learning, you're just feeding yourself over-confidence on how much you're absorbing vs what the LLM is facilitating you rolling out.

tossandthrow · 2026-04-18T19:24:57 1776540297

This is such a weird statement in so many levels.

The latent assumption here is that learning is zero sum.

That you can take a 30 year old from 1856 bring them into present day and they will learn whatever subject as fast as a present day 20 year old.

That teachers doesn't matter.

That engagement doesn't matter.

Learning is not zero sum. Some cultural background makes learning easier, some mentoring makes is easier, and some techniques increases engagement in ways that increase learning speed.

bluefirebrand · 2026-04-18T17:33:23 1776533603

> I am learning at an incredible rate with LLMs

Could you do it again without the help of an LLM?

If no, then can you really claim to have learned anything?

_blk · 2026-04-18T17:57:30 1776535050

The challenge is not if you could do all of it without AI but any of it that you couldn't before.

Not everyone learns at the same pace and not everyone has the same fault tolerance threshold. In my experiencd some people are what I call "Japanese learners" perfecting by watching. They will learn with AI but would never do it themselves out of fear of getting something wrong while they understand most of it, others that I call "western learners" will start right away and "get their hands dirty" without much knowledge and also get it wrong right away. Both are valid learning strategies fitting different personalities.

tossandthrow · 2026-04-18T17:38:24 1776533904

I could definitely maintain the infrastructure without an llm. Albeit much slower.

And yes. If LLMs disappear, then we need to hire a lot of people to maintain the infrastructure.

Which naturally is a part of the risk modeling.

bluefirebrand · 2026-04-18T18:32:22 1776537142

> I could definitely maintain the infrastructure without an llm

Not what I asked, but thanks for playing.

tossandthrow · 2026-04-18T19:28:03 1776540483

You literally asked that question

> Could you do it again without the help of an LLM?

bluefirebrand · 2026-04-18T20:26:36 1776543996

And the question you answered was "could you maintain it without the help of an LLM"

Paradigma11 · 2026-04-18T19:41:49 1776541309

So, you havent really learned anything from any teacher if you could not do it again without them?

lelanthran · 2026-04-18T22:35:37 1776551737

> So, you havent really learned anything from any teacher if you could not do it again without them?

Well, yes?

What do you think "learning" means? If you cannot do something without the teacher, you haven't learned that thing.

techpression · 2026-04-18T20:20:36 1776543636

That would be the definition of learning something, yes.

falkensmaize · 2026-04-18T19:52:17 1776541937

I mean...yeah?

If your child says they've learned their multiplication tables but they can't actually multiply any numbers you give them do they actually know how to do multiplication? I would say no.

Jweb_Guru · 2026-04-18T20:20:56 1776543656

For some reason people are perfectly able to understand this in the context of, say, cursive, calculator use, etc., but when it comes to their own skillset somehow it's going to be really different.

UncleMeat · 2026-04-18T20:37:15 1776544635

Yes that's exactly right.

sho_hn · 2026-04-18T20:14:17 1776543257

danw1979 · 2026-04-18T17:44:18 1776534258

I think this is a bit dismissive.

It’s quite possible to be deep into solving a problem with an LLM guiding you where you’re reading and learning from what it says. This is not really that different from googling random blogs and learning from Stack Overflow.

Assuming everyone just sits there dribbling whilst Claude is in YOLO mode isn’t always correct.

subscribed · 2026-04-18T18:50:43 1776538243

>> I am learning a new skill with instructor at an incredible rate

> Could you do it again on your own?

Can you you see how nonsensical your stance is? You're straight up accusing GP of lying they are learning something at the increased rate OR suggesting if they couldn't learn that, presumably at the same rate, on they own, they're not learning anything.

That's not very wise to project your own experiences on others.

sroussey · 2026-04-18T19:34:30 1776540870

Actually, it’s much like taking a physics or engineering course, and after the class being fully able to explain the class that day, and yet realize later when you are doing the homework that you did not actually fully understand like you thought you did.

i_love_retros · 2026-04-18T17:37:56 1776533876

>I am learning at an incredible rate with LLMs.

I don't believe it. Having something else do the work for you is not learning, no matter how much you tell yourself it is.

margalabargala · 2026-04-18T17:54:05 1776534845

If you've seen further it's only because you've stood on the shoulders of giants.

Having other people do work for you is how people get to focus on things they actually care about.

Do you use a compiler you didn't write yourself? If so can you really say you've ever learned anything about computers?

butterisgood · 2026-04-18T18:24:31 1776536671

You have to build a computer to learn about computers!

viccis · 2026-04-18T21:21:09 1776547269

I would argue that if you've just watched videos about building computers and haven't sat down and done one yourself, then yeah I don't see any evidence that you've learned how to build a computer.

margalabargala · 2026-04-19T02:11:13 1776564673

And, so the anti-LLM argument goes, if you've not built the computer you can't learn anything about what computers could be used for.

viccis · 2026-04-19T19:31:37 1776627097

That's not the anti-LLM argument, that's a brand new argument you made up.

margalabargala · 2026-04-20T00:00:55 1776643255

Did you not read the comment thread you replied to? That's the exact argument that I_love_retros made above.

That is in fact the anti LLM argument you've ostensibly been discussing. If you want to talk to the person who made it up I'm not your guy.

tossandthrow · 2026-04-18T17:41:33 1776534093

It is easy to not believe if you only apply an incredibly narrow world view.

Open your eyes, and you might become a believer.

nothinkjustai · 2026-04-18T17:59:49 1776535189

What is this, some sort of cult?

subscribed · 2026-04-18T19:26:44 1776540404

You mean the cult of "I can't see the viruses therefore they dint exist"? As in "I can't imagine something so it means it's a lie"?

Indeed, quite weird and no imagination.

tossandthrow · 2026-04-18T18:16:27 1776536187

No, it is an as snarky response to a person being snarky about usefulness of AI agents.

It does seem like there is a cult of people who categorically see LLMs as being poor at anything without it being founded in anything experience other than their 2023 afternoon to play around with it.

nothinkjustai · 2026-04-18T19:05:02 1776539102

Who cares? Why are people so invested in trying to “convert” others to see the light?

Can’t you be satisfied with outcompeting “non believers”? What motivates you to argue on the internet about it? Deep down are you insecure about your reliance on these tools or something, and want everyone else to be as well?

tossandthrow · 2026-04-18T19:27:00 1776540420

Why do people invest themselves so hard in interjecting themselves into conversations about Ai telling people it doesn't work?

It feels so off rebuilding serious SaaS apps in days for production, only to be told it is not possible?

nothinkjustai · 2026-04-18T21:38:52 1776548332

Who here said ai “doesn’t work”?

Wowfunhappy · 2026-04-18T19:07:12 1776539232

> We have gone multi cloud disaster recovery on our infrastructure. Something I would not have done yet, had we not had LLMs.

That’s product atrophy, not skill atrophy.

deadbabe · 2026-04-18T17:32:25 1776533545

Using LLMs as a learning tool isn’t what causes skill atrophy. It’s using them to solve entire problems without understanding what they’ve done.

And not even just understanding, but verifying that they’ve implemented the optimal solution.

tehjoker · 2026-04-18T19:56:42 1776542202

It's partly that, but also reading and surface level understanding something vs generating yourself are different skills with different depths. If you're learning a language, you can get good at listening without getting good at speaking for example.

jjallen · 2026-04-18T17:31:39 1776533499

Also AI could help you pick those skills up again faster, although you wouldn’t need to ever pick those skills up again unless AI ceased to exist.

What an interesting paradox-like situation.

estetlinus · 2026-04-18T18:13:36 1776536016

I believe some professor warned us about being over reliant on Google/reddit etc: “how would you be productive if internet went down” dilemma.

Well, if internet is down, so is our revenue buddy. Engineering throughput would be the last of our concerns.

solarengineer · 2026-04-18T18:16:20 1776536180

https://hex.ooo/library/power.html

When future humans rediscover mathematics.

IgorPartola · 2026-04-18T20:00:19 1776542419

Yeah I am worried about skill atrophy too. Everyone uses a compiler these days instead of writing assembly. Like who the heck is going to do all the work when people forget how to use the low level tools and a compiler has a bug or something?

And don’t get me started on memory management. Nobody even knows how to use malloc(), let alone brk()/mmap(). Everything is relying on automatic memory management.

I mean when was the last time you actually used your magnetized needle? I know I am pretty rusty with mine.

otabdeveloper4 · 2026-04-18T20:03:00 1776542580

> an LLM is exactly like a compiler if a compiler was a black box hosted in a proprietary cloud and metered per symbol

Yeah, exactly.

techpression · 2026-04-18T20:27:33 1776544053

Snark aside, this is an actual problem for a lot of developers in varying degrees, not understand anything about the layers below make for terrible layers above in very many situations.

dgellow · 2026-04-18T18:00:31 1776535231

Another aspect I haven’t seen discussed too much is that if your competitor is 10x more productive with AI, and to stay relevant you also use AI and become 10x more productive. Does the business actually grow enough to justify the extra expense? Or are you pretty much in the same state as you were without AI, but you are both paying an AI tax to stay relevant?

xixixao · 2026-04-18T18:09:36 1776535776

This is the “ad tax” reasoning, but ultimately I think the answer is greater efficiency. So there is a real value, even if all competitors use the tools.

It’s like saying clothing manufacturers are paying the “loom tax” tax when they could have been weaving by hand…

SlinkyOnStairs · 2026-04-18T18:26:58 1776536818

Software development is not a production line, the relationship between code output and revenue is extremely non-linear.

Where producing 2x the t-shirts will get you ~2x the revenue, it's quite unlikely that 10x the code will get you even close to 2x revenue.

With how much of this industry operates on 'Vendor Lock-in' there's a very real chance the multiplier ends up 0x. AI doesn't add anything when you can already 10x the prices on the grounds of "Fuck you. What are you gonna do about it?"

groundzeros2015 · 2026-04-18T18:42:02 1776537722

Yep and in a vendor lock in scenario, fixing deep bugs or making additions in surgical ways is where the value is. And Claude helps you do that, by giving you more information, analyzing options, but it doesn’t let you make that decision 10x faster.

bigbadfeline · 2026-04-18T19:09:08 1776539348

We already know how to multiply the efficiency of human intelligence to produce better quality than LLMs and nearly match their productivity - open source - in fact coding LLMs wouldn't even exist without it.

Open source libraries and projects together with open source AI is the only way to avoid the existential risks of closed source AI.

dakiol · 2026-04-18T19:24:10 1776540250

Where's the evidence of competitors being 10x more productive? So far, everyone is simply bragging about how much code they have shipped last week, but that has zero relevance when it comes to productivity

davidron · 2026-04-18T22:34:14 1776551654

I work at a 20-year-old mid-sized SaaS company. As long as the company has been around, product managers have longed for more engineers and strategies for engineers to ship features faster. As of around February, those same product managers across the org are complaining that they can't keep up with the pace at which engineers are shipping their features. This isn't just lines of code. This is the entire company trying to figure out how to help the PMs because engineers suddenly stopped being the bottleneck.

I don't know about 10x, but this could only happen if PMs suddenly got really lazy or the engineers actually got at least 1.5x faster. My gut says it's way more because we're now also consistently up to date on our dependencies and completing massive refactors we were putting off for years.

There are lots of reasons this could be the case. Quality suddenly changed, the nature of the work changed, engineers leveled up... But for this to have happened consistently across a bunch of engineering teams is quite the coincidence if not this one thing we are all talking about.

dgellow · 2026-04-18T20:04:00 1776542640

Read it as just a given rate. The number doesn’t matter too much here, if company B does believe claims from company A they are N times more productive that’s enough to force B to adopt the same tooling.

Silhouette · 2026-04-18T20:14:22 1776543262

I feel like a lot of the AI advocacy today is like the Cloud advocacy of a few years ago or the Agile advocacy before that. It's this season's silver bullet to make us all 10x more effective according to metrics that somehow never translate into adding actually useful functionality and quality 10x as fast.

The evangelists told us 20 years ago that if we weren't doing TDD then we weren't really professional programmers at all. The evangelists told us 10 years ago that if we were still running stuff locally then we must be paying a fortune for IT admin or not spending our time on the work that mattered. The evangelists this week tell us that we need to be using agents to write all our code or we'll get left in the dust by our competitors who are.

I'm still waiting for my flying car. Would settle for some graphics software on Linux that matches the state of the art on Windows or even reliable high-quality video calls and online chat rooms that don't make continental drift look fast.

redanddead · 2026-04-18T18:15:58 1776536158

The alternative is probably also true. If your F500 competitor is also handicapped by AI somehow, then you're all stagnant, maybe at different levels. Meanwhile Anthropic is scooping up software engineers it supposedly made irrelevant with Mythos and moving into literally 2+ new categories per quarter

senordevnyc · 2026-04-18T18:08:50 1776535730

Either the business grows, or the market participants shed human headcount to find the optimal profit margin. Isn’t that the great unknown: what professions are going to see headcount reduction because demand can’t grow that fast (like we’ve seen in agriculture), and which will actually see headcount stay the same or even expand, because the market has enough demand to keep up with the productivity gains of AI? Increasingly I think software writ large is the latter, but individual segments in software probably are the former.

Lihh27 · 2026-04-18T18:27:51 1776536871

it's worse than a tie. 10x everyone just floods the market and tanks per-unit price. you pay the AI tax and your output is worth less.

otabdeveloper4 · 2026-04-18T20:04:41 1776542681

> your competitor is 10x more productive with AI

This doesn't happen. Literally zero evidence of this.

dgellow · 2026-04-18T20:05:43 1776542743

The actual rate isn’t relevant for the discussion

Miner49er · 2026-04-18T20:13:28 1776543208

Well it might.

If the actual rate is .9x then it matters a lot.

Or even if it's like 1.1x, is the cost worth the return?

xvector · 2026-04-19T03:37:17 1776569837

The cost is so small relative to the increase. The cost whining on HN is bizarre to me. Feels like everyone here is on an individual plan and has no understanding of what margins look like for actual business.

Meta pays $750k+ TC and makes far more profit/eng, do you think they care about $5k/eng/mo in inference? A 1.1x increase would be so significant that it would justify the cost easily, especially when you can just compress comps to make up for it

AlexeyBelov · 2026-04-20T11:20:24 1776684024

Nobody is whining here.

otabdeveloper4 · 2026-04-19T09:33:31 1776591211

What? You don't think businesses do financial planning and calculations for profit margins?

Do you really think they go on vibes - "welp, this AI thing seems to improve developer performance, I guess. Heck, what's an extra 5k per developer anyways, amirite".

Well, maybe they really do in your neck of the woods. Explains a lot, I guess.

xvector · 2026-04-19T13:21:56 1776604916

Yes most companies do in fact operate like this. There are tens of thousands of companies that will pay more for the best thing and call it at that, because the cost is dwarfed by what even marginal gains in quality unlock for the business.

otabdeveloper4 · 2026-04-20T04:20:44 1776658844

> the cost is dwarfed by what even marginal gains in quality

That is just, like, your opinion, man.

Also, I doubt these kinds of companies have "quality" of anything, never mind "gains in quality".

surgical_fire · 2026-04-18T20:38:40 1776544720

What if the rate is negative?

Would it matter?

dgellow · 2026-04-20T11:28:08 1776684488

It would matter but would be a different discussion than the one I was going for

JambalayaJimbo · 2026-04-18T18:31:18 1776537078

If the business doesn’t grow then you shed costs like employees

michaelje · 2026-04-18T18:28:47 1776536927

Open models keep closing the eval gap for many tasks, and local inference continues to be increasingly viable. What's missing isn't technical capability, but productized convenience that makes the API path feel like the only realistic option.

Frontier labs are incentivized to keep it that way, and they're investing billions to make AI = API the default. But that's a business model, not a technical inevitability.

trueno · 2026-04-18T20:11:57 1776543117

im hoping and praying that local inference finds it's way to some sort of baseline that we're all depending on claude for here. that would help shape hardware designs on personal devices probably something in the direction of what apple has been doing.

ive had to like tune out of the LLM scene because it's just a huge mess. It feels impossible to actually get benchmarks, it's insanely hard to get a grasp on what everyone is talking about, bots galore championing whatever model, it's just way too much craze and hype and misinformation. what I do know is we can't keep draining lakes with datacenters here and letting companies that are willing to heel turn on a whim basically control the output of all companies. that's not going to work, we collectively have to find a way to make local inference the path forward.

everyone's foot is on the gas. all orgs, all execs, all peoples working jobs. there's no putting this stuff down, and it's exhausting but we have to be using claude like _right now_. pretty much every company is already completely locked in to openai/gemini/claude and for some unfortunate ones copilot. this was a utility vendor lock in capture that happened faster than anything ive ever seen in my life & I already am desperate for a way to get my org out of this.

hakfoo · 2026-04-18T21:04:27 1776546267

I'm frustrated that there's not "solid" instructional tooling. I either see people just saying "keep trying different prompts and switching models until you get lucky" or building huge cantilevered toolchains that seems incredibly brittle, and even then, how well do they really work?

I get choice paralysis when you show me a prompt box-- I don't know what I can reasonably ask for and how to best phrase it, so I just panic. It doesn't help when we see articles saying people are getting better outcomes by adding things like "and no bugs plz owo"

I'm sure this is by design-- anything with clear boundaries and best practices would discourage gacha style experimentation. Can you trust anyone who sells you a metered service to give you good guidance on how to use it efficiently?

trueno · 2026-04-18T21:18:51 1776547131

yea that is probably the worst part of these techs becoming mainstream services and local-LLM'ing taking off in general: working with them at many points in any architecture no longer feels... deterministic i guess. way too fucking much "heres what i use" but no real best practices yet, just a lot of vague gray area and everyones still in discovery-mode on how to best find some level of determinism or workflow and ways we are benchmarking is seriously a moving target. everyone has their own branded take on what the technology is and their own branded approach on how to use it, and it's probably the murkiest and foggiest time to be in technology fields that i've ever seen :\ seems like weekly/monthly something is outdated, not just the models but the tooling people are parroting as the current best tooling to use. incredibly frustrating. there's simply too much ground to cover for any one person to have any absolute takes on any of it, and because a handful of entities are currently leading the charge draining lakes and trying to compete for every person and every businesses money, there's zero organized frameworks at the top to make some sense of this. they all are banking on their secret sauce, and i _really_ want us all to get away from this. local inference has to succeed imo but goddamn there needs to be some collective working together to rally behind some common strats/frameworks here. im sure there's already countless committees that have been established to try and get in front of this but even that's messy.

i don't know how else to phrase it: this feels like such an unstable landscape, "beta" software/services are running rampant in every industry/company/org/etc and there's absolutely no single resource we can turn to to help stay ahead of & plan for the rapidly-evolving landscape. every, and i mean every company, is incredibly irresponsible for using this stuff. including my own. once again though, cat's already out of the bag. now we fight for our lives trying to contain it and ensure things are well understood and implemented properly...which seems to be the steepest uphill battle of my life

dewarrn1 · 2026-04-18T17:59:42 1776535182

I'm hopeful that new efficiencies in training (Deepseek et al.), the impressive performance of smaller models enhanced through distillation, and a glut of past-their-prime-but-functioning GPUs all converge make good-enough open/libre models cheap, ubiquitous, and less resource-intensive to train and run.

i_love_retros · 2026-04-18T17:36:49 1776533809

> we don't want a hard dependency on another multi-billion dollar company just to write software

My manager doesn't even want us to use copilot locally. Now we are supposed to only use the GitHub copilot cloud agent. One shot from prompt to PR. With people like that selling vendor lock in for them these companies like GitHub, OpenAI, Anthropic etc don't even need sales and marketing departments!

tossandthrow · 2026-04-18T17:43:24 1776534204

You are aware that using eg. Github copilot is not one shot? It will start an agentic loop.

dgellow · 2026-04-18T18:01:49 1776535309

Unnecessary nitpicking

tossandthrow · 2026-04-18T18:52:19 1776538339

Why?

One shoting has a very specific meaning, and agentic workflows are not it?

What is the implied meaning I should understand from them using one shot?

They might refer to the lack of humans in the loop.

dgellow · 2026-04-18T20:01:47 1776542507

You give a prompt, you get a PR. If it is ready to merge with the first attempt, that’s a one shot. The agentic loop is a detail in their context

tossandthrow · 2026-04-18T17:26:48 1776533208

The lock in is so incredibly poor. I could switch to whatever provider in minuets.

But it requires that one does not do something stupid.

Eg. For recurring tasks: keep the task specification in the source code and just ask Claude to execute it.

The same with all documentation, etc.

aliljet · 2026-04-18T17:43:51 1776534231

What open models are truly competing with both Claude Code and Opus 4.7 (xhigh) at this stage?

xvector · 2026-04-19T03:39:59 1776569999

Spent a lot of time with "open models." None of them come close. They are benchmaxxed. But you won't hear many of the open model fans on HN admit this.

The open model mentality is also just so bizarre to me. You're going to use an inferior model to save, what, a couple hundred bucks a month? Is your time really worth that little?

No one working on a serious project at a serious company is downgrading their agent's intelligence for a marginal cost saving. Downgrading your model is like downgrading the toilet paper on your yacht.

tredre3 · 2026-04-19T04:26:58 1776572818

> The open model mentality is also just so bizarre to me. You're going to use an inferior model to save, what, a couple hundred bucks a month? Is your time really worth that little?

I agree that people who claim that open models are as good as claude/openai/z are lying, delusional, or not doing very much. I've tried them all, included GLM 5.1.

GLM is not bad but the hardware needed will never recoup the ROI vs just using a commercial provider through its API.

That being said, you're being reductive here. For many use cases local models offer advantages that can't obtained through a commercial API : Privacy, ownership of the entire stack, predictability. They can't be rugpulled, they can't snitch on you. They will not give you 503.

Those advantages are very valuable for things like a local assistant, as an agent, for data extraction, for translations, for games (role playing and whatnot), etc.

That being said I know that many people are like you, they don't give a second thought about privacy. They'd plug Anthropic to their brain if they could. So I understand the sentiment. I just think that you should in turn try to understand why someone would use an open model.

WarmWash · 2026-04-19T04:08:00 1776571680

Glm 5.1 getting 5% on ARC-AGI 2 private is all anyone needs to know.

parinporecha · 2026-04-18T18:10:57 1776535857

I've had a good experience with GLM-5.1. Sure it doesn't match xhigh but comes close to 4.6 at 1/3rd the cost

slopinthebag · 2026-04-19T08:58:43 1776589123

1/3? Try 2/13 :P

5.1 is like $4 / 1m output, Opus 4.6 is $25. GPT 5.4 pro is $270 with large contexts :O

esafak · 2026-04-18T18:09:16 1776535756

GLM 5.1 competes with Sonnet. I'm not confident about Opus, though they claim it matches that too.

ojosilva · 2026-04-18T18:55:46 1776538546

I have it as failover to Opus 4.6 in a Claude proxy internally. People don't notice a thing when it triggers, maybe a failed tool call here and there (harness remains CC not OC) or a context window that has gone over 200k tokens or an image attachment that GLM does not handle, otherwise hunky-dory all the way. I would also use it as permanent replacement for haiku at this proxy to lower Claude costs but have not tried it yet. Opus 4.7 has shaken our setup badly and we might look into moving to Codex 100% (GLM could remain useful there too).

Someone1234 · 2026-04-18T18:01:36 1776535296

That's a lame attitude. There are local models that are last year's SOTA, but that's not good enough because this year's SOTA is even better yet still...

I've said it before and I'll say it again, local models are "there" in terms of true productive usage for complex coding tasks. Like, for real, there.

The issue right now is that buying the compute to run the top end local models is absurdly unaffordable. Both in general but also because you're outbidding LLM companies for limited hardware resources.

You have a $10K budget, you can legit run last year's SOTA agentic models locally and do hard things well. But most people don't or won't, nor does it make cost effective sense Vs. currently subsidized API costs.

gbro3n · 2026-04-18T18:10:55 1776535855

I completely see your point, but when my / developer time is worth what it is compared to the cost of a frontier model subscription, I'm wary of choosing anything but the best model I can. I would love to be able to say I have X technique for compensating for the model shortfall, but my experience so far has been that bigger, later models out perform older, smaller ones. I genuinely hope this changes through. I understand the investment that it has taken to get us to this point, but intelligence doesn't seem like it's something that should be gated.

Someone1234 · 2026-04-18T18:19:52 1776536392

Right; but every major generation has had diminishing returns on the last. Two years ago the difference was HUGE between major releases, and now we're discussing Opus 4.6 Vs. 4.7 and people cannot seem to agree if it is an improvement or regression (and even their data in the card shows regressions).

So my point is: If you have the attitude that unless it is the bleeding edge, it may have well not exist, then local models are never going to be good enough. But truth is they're now well exceeding what they need to be to be huge productivity tools, and would have been bleeding edge fairly recently.

gbro3n · 2026-04-18T18:48:03 1776538083

I feel like I'm going to have to try the next model. For a few cycles yet. My opinion is that Opus 4.7 is performing worse for my current work flow, but 4.6 was a significant step up, and I'd be getting worse results and shipping slower if I'd stuck with 4.5. The providers are always going to swear that the latest is the greatest. Demis Hassabis recently said in an interview that he thinks the better funded projects will continue to find significant gains through advanced techniques, but that open source models figure out what was changed after about 6 months or so. We'll see I guess. Don't get me wrong, I'd love to settle down with one model and I'd love it to be something I could self host for free.

dakiol · 2026-04-18T19:26:40 1776540400

> I completely see your point, but when my / developer time is worth what it is compared to the cost of a frontier model subscription, I'm wary of choosing anything but the best model I can.

Don't you understand that by choosing the best model we can, we are, collectively, step by step devaluating what our time is worth? Do you really think we all can keep our fancy paychecks while keep using AI?

gbro3n · 2026-04-18T20:04:52 1776542692

Do you think if you or me stopped using AI that everyone else will too? We're still what we always were - problem solvers who have gained the ability to learn and understand systems better that the general population, communicate clearly (to humans and now AIs). Unfortunately our knowledge of language APIs and syntax has diminished in value, but we have so many more skills that will be just as valuable as ever. As the amount of software grows, so will the need for people who know how to manage the complexity that comes with it.

lelanthran · 2026-04-18T22:53:45 1776552825

> Unfortunately our knowledge of language APIs and syntax has diminished in value, but we have so many more skills that will be just as valuable as ever.

There were always jobs that required those "many more skills" but didn't require any programming skills.

We call those people Business Analysts and you could have been doing it for decades now. You didn't, because those jobs paid half what a decent/average programmer made.

Now you are willingly jumping into that position without realising that the lag between your value (i.e. half your salary, or less) would eventually disappear.

gbro3n · 2026-04-19T07:15:12 1776582912

I guess we will need to wait and see if AI can remove ALL of the complexity that requires a software engineer over a business analyst. I can't currently believe that it will. BA's I've worked with vary in technical capability from 'having coded before and understanding DB schema basics and network architecture' to 'I know how the business works but nothing about computers'. If we got to the point in the future where every computer system ran on the same frameworks in the same way, and AI understood it perfectly, then maybe. But while AI is a probabilistic technology manipulating deterministic systems, we will always need people to understand whats going on, and whether they write a lot of code or not, they will be engineers, not analysts. Whether it's more or less of those people, we will see.

lelanthran · 2026-04-19T07:57:06 1776585426

> If we got to the point in the future where every computer system ran on the same frameworks in the same way, and AI understood it perfectly, then maybe.

They don't need to all run on the same frameworks, they just need to run on documented frameworks.

What possible value can you bring to a BA?

The system topology (say, if the backend was microservices vs Lambda vs something-else)? The LLM can explain to the BA what their options are, and the impact of those options.

The framework being used (Vue, or React, or something else)? The AI can directly twiddle that for the BA.

Solving a problem? If the observability is setup, the LLM can pinpoint almost all the problems too,and with a separate UAT or failover-type replica, can repro, edit, build, deploy and test faster than you can.

Like I already said, if[1] you're now able to build or enhance a system without actually needing programming skills, why are you excited about that? You could always do that. It's just that it pays half what programming skills gets you.

You (and many others who boast about not writing code since $DATE) appear to be willingly moving to a role that already pays less, and will pay even less once the candidates for that role double (because now all you programmers are shifting towards it).

It's supply and demand, that's all.

--------------

[1] That's a very big "If", I think. However, the programmers who are so glad to not program appear to believe that it's a very small "If", because they're the ones explaining just how far the capabilities have come in just a year, and expect the trend to continue. Of course, if the SOTA models never get better than what we have now, then, sure - your argument holds - you'll still provide value.

aliljet · 2026-04-18T18:46:27 1776537987

First, making sure to offer an upvote here. I happen to be VERY enthusiastic about local models, but I've found them to be incredibly hard to host, incredibly hard to harness, and, despite everything, remarkably powerful if you are willing to suffer really poor token/second performance...

wellthisisgreat · 2026-04-18T19:01:01 1776538861

> that are last year's SOTA

Early last year or late last year?

opus 4.5 was quite a leap

HWR_14 · 2026-04-18T19:34:31 1776540871

$10k is a lot of tokens.

sscaryterry · 2026-04-18T19:52:58 1776541978

At the rate its consuming now, I'd probably blow $10k in a month easy.

leonidasv · 2026-04-18T19:16:28 1776539788

>perhaps we can come up with something like the "linux/postgres/git/http/etc" of the LLMs

I fear that this may not be feasible in the long term. The open-model free ride is not guaranteed to continue forever; some labs offer them for free for publicity after receiving millions in VC grants now, but that's not a sustainable business model. Models cost millions/billions in infrastructure to train. It's not like open-source software where people can just volunteer their time for free; here we are talking about spending real money upfront, for something that will get obsolete in months.

Current AI model "production" is more akin to an industrial endeavor than open-source arrangements we saw in the past. Until we see some breakthrough, I'm bearish on "open models will eventually save us from reliance on big companies".

falkensmaize · 2026-04-18T19:59:04 1776542344

"get obsolete in months"

If you mean obsolete in the sense of "no longer fit for purpose" I don't think that's true. They may become obsolete in terms of "can't do hottest new thing" but that's true of pretty much any technology. A capable local model that can do X will always be able to do X, it just may not be able to do Y. But if X is good enough to solve your problem, why is a newer better model needed?

I think if we were able to achieve ~Opus 4.6 level quality in a local model that would probably be "good enough" for a vast number of tasks. I think it's debatable whether newer models are always better - 4.7 seems to be somewhat of a regression for example.

sergiotapia · 2026-04-18T18:49:35 1776538175

I can recommend this stack. It works well with the existing Claude skills I had in my code repos:

1. Opencode

2. Fireworks AI: GLM 5.1

And it is SIGNIFICANTLY cheaper than Claude. I'm waiting eagerly for something new from Deepseek. They are going to really show us magic.

dirasieb · 2026-04-18T18:51:15 1776538275

it is also significantly less capable than claude

dakiol · 2026-04-18T19:28:54 1776540534

That's fine. When the "best of the best" is offered only by a couple of companies that are not looking into our best interests, then we can discard them

ben8bit · 2026-04-18T17:18:29 1776532709

Any recommendations on good open ones? What are you using primarily?

culi · 2026-04-18T18:06:41 1776535601

LMArena actually has a nice Pareto distribution of ELO vs price for this

  model                        elo   $/M
  ---------------------------------------
  glm-5.1                      1538  2.60
  glm-4.7                      1440  1.41
  minimax-m2.7                 1422  0.97
  minimax-m2.1-preview         1392  0.78
  minimax-m2.5                 1386  0.77
  deepseek-v3.2-thinking       1369  0.38
  mimo-v2-flash (non-thinking) 1337  0.24

https://arena.ai/leaderboard/code?viewBy=plot&license=open-s...

logicprog · 2026-04-18T19:25:05 1776540305

LMArena isn't very useful as a benchmark, however I can vouch for the fact that GLM 5.1 is astonishingly good. Several people I know who have a $100/mo Claude Code subscription are considering cancelling it and going all in on GLM, because it's finally gotten (for them) comparable to Opus 4.5/6. I don't use Opus myself, but I can definitely say that the jump from the (imvho) previous best open weight model Kimi K2.5 to this is otherworldly — and K2.5 was already a huge jump itself!

blahblaher · 2026-04-18T17:21:50 1776532910

qwen3.5/3.6 (30B) works well,locally, with opencode

zozbot234 · 2026-04-18T17:31:08 1776533468

Mind you, a 30B model (3B active) is not going to be comparable to Opus. There are open models that are near-SOTA but they are ~750B-1T total params. That's going to require substantial infrastructure if you want to use them agentically, scaled up even further if you expect quick real-time response for at least some fraction of that work. (Your only hope of getting reasonable utilization out of local hardware in single-user or few-users scenarios is to always have something useful cranking in the background during downtime.)

pitched · 2026-04-18T17:49:05 1776534545

For a business with ten or more engineers/people-using-ai, it might still make sense to set this up. For an individual though, I can’t imagine you’d make it through to positive ROI before the hardware ages out.

zozbot234 · 2026-04-18T18:01:56 1776535316

It's hard to tell for sure because the local inference engines/frameworks we have today are not really that capable. We have barely started exploring the implications of SSD offload, saving KV-caches to storage for reuse, setting up distributed inference in multi-GPU setups or over the network, making use of specialty hardware such as NPUs etc. All of these can reuse fairly ordinary, run-of-the-mill hardware.

DeathArrow · 2026-04-18T18:42:10 1776537730

Since you need at least a few of H100 class hardware, I guess you need at least few tens of coders to justify the costs.

pitched · 2026-04-19T02:35:39 1776566139

I see the 512GB Mac Studios aren’t for sale anymore but that was a much cheaper path

cyberax · 2026-04-18T18:39:05 1776537545

I'm backing up a big dataset onto tapes, so I wanted to automate it. I have an idle 64Gb VRAM setup in my basement, so I decided to experiment and tasked it with writing an LTFS implementation. LTFS is an open standard for filesystems for tapes, and there's an implementation in C that can be used as the baseline.

So far, Qwen 3.6 created a functionally equivalent Golang implementation that works against the flat file backend within the last 2 days. I'm extremely impressed.

Gareth321 · 2026-04-18T21:10:35 1776546635

It is surprisingly competent. It's not Opus 4.6 but it works well for well structured tasks.

wuschel · 2026-04-18T18:21:00 1776536460

What near SOTA open models are you referring to?

pitched · 2026-04-18T17:35:41 1776533741

I want to bump this more than just a +1 by recommending everyone try out OpenCode. It can still run on a Codex subscription so you aren’t in fully unfamiliar territory but unlocks a lot of options.

zozbot234 · 2026-04-18T17:39:41 1776533981

The Codex TUI harness is also open source and you can use open models with it, so you can stay in even more familiar territory.

pwython · 2026-04-18T17:55:36 1776534936

pi-coding-agent (pi.dev) is also great. I've been using it with Gemma 4 and Qwen 3.6.

equasar · 2026-04-18T22:16:11 1776550571

The thing I dislike about OpenCode is the lack of capabilities of their editor, also, resource intensive, for some reason on a VM it chuckles each 30 mins, that I need to discard all sessions, commits, etc.

I don't know if it is bun related, but in task manager, is the thing that is almost at the top always on CPU usage, turns out for me, bun is not production ready at all.

Wish Zed editor had something like BigPickle which is free to use without limits.

Jarred · 2026-04-19T06:38:33 1776580713

> turns out for me, bun is not production ready

What issue did you run into?

jherdman · 2026-04-18T17:39:23 1776533963

Is this sort of setup tenable on a consumer MBP or similar?

danw1979 · 2026-04-18T17:55:16 1776534916

Qwen’s 30B models run great on my MBP (M4, 48GB) but the issue I have is cooling - the fan exhaust is straight onto the screen, which I can’t help thinking will eventually degrade it, given the thermal cycling it would go through. A Mac Studio makes far more sense for local inference just for this reason alone.

pitched · 2026-04-18T17:44:22 1776534262

For a 30B model, you want at least 20GB of VRAM and a 24GB MBP can’t quite allocate that much of it to VRAM. So you’d want at least a 32GB MBP.

richardfey · 2026-04-18T18:30:58 1776537058

I have 24GB VRAM available and haven't yet found a decent model or combination. Last one I tried is Qwen with continue, I guess I need to spend more time on this.

_blk · 2026-04-18T18:10:17 1776535817

Is there any model that practically compares to Sonnet 4.6 in code and vision and runs on home-grade (12G-24G) cards?

macwhisperer · 2026-04-18T20:34:15 1776544455

im currently running a custom Gemma4 26b MoE model on my 24gb m2... super fast and it beat deepseek, chatgpt, and gemini in 3 different puzzles/code challenges I tested it on. the issue now is the low context... I can only do 2048 tokens with my vram... the gap is slowly closing on the frontier models

zozbot234 · 2026-04-18T17:49:52 1776534592

It's a MoE model so I'd assume a cheaper MBP would simply result in some experts staying on CPU? And those would still have a sizeable fraction of the unified memory bandwidth available.

pitched · 2026-04-18T18:04:10 1776535450

I haven’t tried this myself yet but you would still need enough non-vram ram available to the cpu to offload to cpu, right? This is a fully novice question, I have not ever tried it.

tredre3 · 2026-04-19T04:37:28 1776573448

You're correct. If you don't have enough RAM for the model, it can still run but most of it will run on the CPU and be continuously reloaded from the SSD (through mmap).

A medium MoE like 35B can still achieve usable speeds in that setup, mind you, depending on what you're doing.

Gareth321 · 2026-04-18T21:11:33 1776546693

The Mac Minis (probably 64GB RAM) are the most cost effective.

cpursley · 2026-04-18T17:36:03 1776533763

How are you running it with opencode, any tips/pointers on the setup?

cmrdporcupine · 2026-04-18T17:32:47 1776533567

GLM 5.1 via an infra provider. Running a competent coding capable model yourself isn't viable unless your standards are quite low.

myaccountonhn · 2026-04-18T18:15:06 1776536106

What infra providers are there?

elbear · 2026-04-18T18:31:00 1776537060

There's DeepInfra. There's also OpenRouter where you can find several providers.

DeathArrow · 2026-04-18T18:38:50 1776537530

I am using GLM 5.1 and MiniMax 2.7.