More

steveharing1 · 2026-05-02T13:39:15 1777729155

nice work!

steveharing1 · 2026-05-02T13:34:34 1777728874

I really appreciate open source community for moving this fast

steveharing1 · 2026-04-30T15:55:00 1777564500

yes. I've used it a lot. its very simple and good

steveharing1 · 2026-04-30T15:51:25 1777564285

You can try Open WebUI. Its genuinely useful when it comes to running open models locally with a clean interface

RationPhantoms · 2026-04-30T16:22:35 1777566155

Yep, couple Open WebUI for general chats and OpenCode for software-specific tasks and it feels close to Claude Desktop and Claude Code.

steveharing1 · 2026-04-30T15:33:32 1777563212

even calling it roll of the dice is an assumption. Can you point anything you find as mistake?

lelanthran · 2026-04-30T16:06:27 1777565187

You expect people to read every single excretion, which can be generated faster than I can read,just to find the rare gem that might exist?

The problem is that in the past it took multiple times more effort and hours to write something than it took to read. That served two purposes:

1. Lazy people just looking for an audience were effectively gatekept from drowning the world with their every vapid thought.

2. Because supply was many times slower than consumption it was viable to give most articles a chance: the author could not drown me in a deluge even if they wanted to.

Having the criteria now that the author should spend at least as much effort creating the piece as they expect the reader expend reading it is a damn useful bar: instead of reading 1000 AI articles just to find the one good one, I can simply read 10 human authored articles and be certain that 9 of them have something worthwhile.

simonw · 2026-04-30T15:35:45 1777563345

No, because I'm not going to spend a bunch of my time fact-checking obvious AI slop.

joquarky · 2026-04-30T19:41:32 1777578092

Then don't complain.

simonw · 2026-04-30T19:56:11 1777578971

steveharing1 · 2026-04-30T13:30:09 1777555809

Thanks for letting me know

steveharing1 · 2026-04-30T10:57:03 1777546623

Yea, No doubt Qwen 3.6 open weights are far more strong

rnadomvirlabe · 2026-04-30T11:11:51 1777547511

Why no doubt?

captainbland · 2026-04-30T11:47:19 1777549639

No comparison with competitor models other than the previous granite version strongly implies that it does not compete well with other comparable models. At least this is the most reasonable assumption until data comes out to the contrary

2ndorderthought · 2026-04-30T12:02:29 1777550549

Qwen 36 is effectively a pocket sized frontier model. It's really surprising for me anyway

steveharing1 · 2026-04-30T11:30:31 1777548631

Because Qwen 3.6 pushes way above its weight. Granite 8B is impressive, but Qwen still wins on raw capability, especially for coding.

rnadomvirlabe · 2026-04-30T11:43:22 1777549402

You just asserted the same thing again. Why do you say this is the case?

noodletheworld · 2026-04-30T12:00:04 1777550404

Having tried it.

Qwen is really good.

Also, generally, it makes sense. 8B models are generally not very good^.

That this 8B model is decent is impressive, but that it could perform on par with a good model 4 times as large is a daydream.

^ - To be polite. The small models + tool use for coding agents are almost universally ass. Proof: my personal experience. Ive tried many of them.

meatmanek · 2026-04-30T17:10:19 1777569019

It's not that surprising that an 8B dense model would compete with a 35B-A3B MoE model.

The geometric mean rule of thumb for MoE models is that the intelligence level of an MoE model with T total parameters and A active parameters is roughly equivalent to that of a dense model with sqrt(A*T) parameters. For qwen3.6-35B-A3B, that equivalent size is 10.24B, spitting distance of an 8B model. Good training can make up the 28% difference in size.

irishcoffee · 2026-04-30T12:10:18 1777551018

So it’s just like, your opinion, man?

edit: It was a play on The Big Lebowski, folks.

Terretta · 2026-04-30T12:24:52 1777551892

College SAT scores do not tell you how the dev applying for your open back end systems engineering job is going to do once they're in your workplace harness.

Nor do class standings, nor hackerrank and the like.

What will tell you is asking them to fix a thing in your codebase. Once you ask an LLM to do that, a dozen times, I'd argue it's no longer "just your opinion man", it's a context-engineered performance x applicability assessment.

And it is very predictive.

But it's also why someone doing well at job A isn't necessarily going to be great at B, or bad at A doesn't mean will necessarily be bad at B.

I've often felt we should normalize a sort of mutual try-buy period where job-change seeker and company can spend a series of days without harming one's existing employment, to derisk the mutual learning. ESPECIALLY to derisk the career change for the applicant who only gets one timeline to manage, opposed to company that considers the applicant fungible.

But back to the LLM, yeah, the only valid opinion on whether it works for you is not benchmark, it's an informed opinion from 'using it in anger'.

noodletheworld · 2026-04-30T14:49:39 1777560579

> So it’s just like, your opinion, man?

Yes.

That is how you empirically evaluate tools; not by reading stupid benchmarks. By actually using the tools, for hours and hours. Doing real work.

Did you try using it? For hours? Do you use qwen?

How about you tell us about your experience with your great 8B models that you use daily. What coding agent harness do you have then hooked up to? What context size can you get before they lose track of whats happening? Do you swap between models for different coding tasks?

Or, have you not, actually, even actually tried any of this stuff, yourself?

irishcoffee · 2026-04-30T21:17:20 1777583840

Work pays for copilot, so I use copilot. I will never spend a penny of my own money on this stuff. If it is free, I'll use it.

I'll never use any free opensource anything from china ever, so fuck no I haven't used qwen.

robotmaxtron · 2026-04-30T12:39:37 1777552777

the (dead) internet is full of opinions exactly like this

brazukadev · 2026-04-30T12:44:20 1777553060

you tried qwen3.6 and you think it is not good?

robotmaxtron · 2026-04-30T12:53:48 1777553628

I do not have high opinions of any ai model.

2ndorderthought · 2026-04-30T12:08:36 1777550916

Qwen scores above sonnet in coding benchmarks. Runs locally. In personal use it's really good. Anecdotally others have used it to vibe code or agentic code successfully. Not toy problems. Not a toy model.

Qwen3.6 raises the bar for models of its size. There really isn't a comparison in my opinion.

albedoa · 2026-04-30T19:23:46 1777577026

Maybe you could tell him what you want instead of making him guess.

actionfromafar · 2026-04-30T11:40:43 1777549243

Way above its weights.

drittich · 2026-04-30T11:50:13 1777549813

Nanobanana for scale.

steveharing1 · 2026-04-29T09:27:05 1777454825

I think github is at a point that its too hard to ignore just like google is even though we might not like what they are doing now but we were the one made them this big.

steveharing1 · 2026-04-28T16:42:36 1777394556

Lets see when the community come up with more quantized versions. Waiting for Unsloth's version

steveharing1 · 2026-04-28T14:08:50 1777385330

It was released around a week ago but Xiaomi Open Sourced it Now & its MIT Licensed and available om HuggingFace