More

mort96 · 2026-04-13T18:50:42 1776106242

I can't even read it because you either have to accept all tracking or pay a subscription fee. Pretty sure that's against the GDPR? Anyway, not a good look.

masfuerte · 2026-04-13T21:42:26 1776116546

It works fine with js disabled.

soco · 2026-04-13T19:13:23 1776107603

Isn't GDPR an EU thing?

mort96 · 2026-04-13T19:19:32 1776107972

Well an EU/EEA thing. And I'm in the EEA, so it applies when I visit The Guardian.

mort96 · 2026-04-13T15:26:48 1776094008

We do not need vibe-coded critical infrastructure.

rl3 · 2026-04-13T22:31:42 1776119502

>> ...give him unlimited model access

>We do not need vibe-coded critical infrastructure.

I think when you have virtually unlimited compute, it affords the ability to really lock down test writing and code review to a degree that isn't possible with normal vibe code setups and budgets.

That said for truly critical things, I could see a final human review step for a given piece of generated code, followed by a hard lock. That workflow is going to be popular if it already isn't.

mort96 · 2026-04-13T22:45:58 1776120358

The availability or lack thereof of compute has absolutely nothing to do with my opinion. More vibe coded tests doesn't fix the problem.

rl3 · 2026-04-13T22:52:08 1776120728

It might when an individual function has 50 different models reviewing it, potentially multiple times each.

Perhaps part of a complex review chain for said function that's a few hundred LLM invocations total.

So long as there's a human reviewing it at the end and it gets locked, I'd argue it ultimately doesn't matter how the code was initially created.

There's a lot of reasons it would matter before it gets to that point, just more to do with system design concerns. Of course, you could also argue safety is an ongoing process that partially derives from system design and you wouldn't be wrong.

It occurred to me there's some recent prior art here:

https://news.ycombinator.com/item?id=47721953

It's probably fair to say the Linux kernel is critical infra, or at least a component piece in a lot of it.

mort96 · 2026-04-13T23:07:28 1776121648

I do not care how strong your vibes are and how many claudes you have producing slop and reviewing each others' slop. I do not think vibe coding is appropriate for critical infrastructure. I don't understand why you think telling me you'd have more slop would make me appreciate it more.

falcor84 · 2026-04-13T15:31:10 1776094270

As I see it, the focus should not be about the coding, but about the testing, and particularly the security evaluation. Particularly for critical infrastructure, I would want us to have a testing approach that is so reliable that it wouldn't matter who/what wrote the code.

jbvlkt · 2026-04-13T21:09:16 1776114556

I have been thinking about that lately and isn't testing and security evaluation way harder problem than designing and carefully implementing new features? I think that vibecoding automates easiest step in SW development while making more challenging/expensive steps harder. How are we suppose to debug complex problems in critical infrastructure if no one understands code? It is possible that in future agents will be able to do that but it feels to me that we are not there yet.

bawolff · 2026-04-13T15:49:01 1776095341

I dont think that will ever be possible.

At some point security becomes - the program does the thing the human wanted it to do but didn't realize they didn't actually want.

No amount of testing can fix logic bugs due to bad specification.

skrtskrt · 2026-04-13T17:25:42 1776101142

AI as advanced fuzz-testing is ridiculously helpful though - hardly any bug you can in this sort of advanced system is a specification logic bug. It's low-level security-based stuff, finding ways to DDOS a local process, or work around OS-level security restrictions, etc.

bawolff · 2026-04-13T19:25:31 1776108331

I'm kind of doubtful that AI is all that great at fuzz testing. Putting that aside though, we are talking about web browsers here. Security issues from bad specification or misunderstanding the specification is relatively common.

thephyber · 2026-04-13T19:03:19 1776106999

Re-read the thread you are replying to.

Each of the last 4 comments in your thread (including yours) are conflating what they mean by AI.

falcor84 · 2026-04-13T16:20:55 1776097255

Well, yes, agreed - that is the essential domain complexity.

But my argument is that we can work to minimize the time we spend on verifying the code-level accidental complexity.

bawolff · 2026-04-13T17:21:04 1776100864

Sure, but that is what we've been doing since the early 2000s (e.g. aslr, read only stacks, static analysis, etc).

And we've had some succeses, but i wouldn't expect any game changing breakthroughs any time soon.

mort96 · 2026-04-13T15:33:58 1776094438

I disagree. Thorough testing provides some level of confidence that the code is correct, but there's immense value in having infrastructure which some people understand because they wrote it. No amount of process around your vibe slop can provide that.

px43 · 2026-04-13T15:56:40 1776095800

That's just status quo, which isn't really holding up in the modern era IMO.

I'm sure we'll have vibed infrastructure and slow infrastructure, and one of them will burn down more frequently. Only time will tell who survives the onslaught and who gets dropped, but I personally won't be making any bets on slow infrastructure.

falcor84 · 2026-04-13T16:18:34 1776097114

I somewhat agree, but even then would argue that the proper level at which this understanding should reside is at the architecture and data flow invariants levels, rather than the code itself. And these can actually be enforced quite well as tests against human-authored diagrammatical specs.

t43562 · 2026-04-13T16:38:10 1776098290

If you don't fully understand the code how do you know it implements your architecture exactly and without doing it in a way that has implications you hadn't thought of?

As a trivial example I just found a piece of irrelevant crap in some code I generated a couple of weeks ago. It worked in the simple cases which is why I never spotted it but would have had some weird effects in more complicated ones. It was my prompting that didn't explain well enough perhaps but how was I to know I failed without reading the code?

jbvlkt · 2026-04-13T21:46:32 1776116792

Exactly. We do not have another artifact than code which can be deterministically converted to program. That is reason we have to still read the code. Prompt is not final product in development process.

mort96 · 2026-04-13T16:35:22 1776098122

I disagree. The code itself matters too.

rafaelmn · 2026-04-13T15:46:43 1776095203

If you're trusting core contributors without AI I don't see why you wouldn't trust them with it.

Hiring a few core devs to work on it should be a rounding error to Anthropic and a huge flex if they are actually able to deliver.

mort96 · 2026-04-13T16:36:27 1776098187

I trust people to understand the code they write. I don't trust them to understand code they didn't write.

t43562 · 2026-04-13T16:42:46 1776098566

It's extremely tempting to write stuff and not bother to understand it similar to the way most of us don't decompile our binaries and look at the assembler when we write C/C++.

So, should I trust an LLM as much as a C compiler?

jddj · 2026-04-13T18:41:37 1776105697

What if it impairs judgement?

andai · 2026-04-13T18:04:35 1776103475

They're getting really good at proofs and theorems, right?

IshKebab · 2026-04-13T20:07:22 1776110842

Proofs/theorems and memory safety vulnerabilities are a special case because there's an easy way to verify whether the model is bullshitting or not.

That's not true for coding in general. The best you can do is having unreasonably good test coverage, but the vast majority of code doesn't have that.

scrame · 2026-04-13T15:58:24 1776095904

Unfortunately we're going to get it whether or not we need it.

teaearlgraycold · 2026-04-13T17:54:38 1776102878

Well if the big players want to tell me their models are nearly AGI they need to put up or shut up. I don't want a stochastically downloaded C compiler. I want tech that improves something.

mort96 · 2026-04-13T15:04:59 1776092699

Yeah the closest thing you come today is arguably WebKitGTK, which is known for being not exactly great.

mort96 · 2026-04-13T15:02:47 1776092567

What do you mean by "production ready" here exactly? In a web browser context, the JS engine is expected to have a high performance optimising JIT compiler. Do the existing Rust JS engines have that?

8NNTt8z3QvLT8tp · 2026-04-13T15:26:02 1776093962

There's something to be said for the security benefits of not having a JIT though. Especially if you've used Rust for the engine you should have pretty solid security.

px43 · 2026-04-13T16:09:25 1776096565

Yeah, having a code section that is writable and executable is a huge no-no from a security standpoint. JIT is a fundamentally insecure concept, just in general. By definition it's trading security for speed.

epcoa · 2026-04-13T18:59:49 1776106789

swiftcoder · 2026-04-13T15:05:27 1776092727

I honestly don't know, but they do say "production ready" on their marketing pages, so...

For an example of what I mean, see JetCrab: https://jetcrab.com

CryZe · 2026-04-13T15:18:11 1776093491

This doesn't implement a JS engine, it's just a wrapper around boa.

mort96 · 2026-04-13T15:14:43 1776093283

That page says:

> Complete JavaScript execution pipeline from source code parsing to bytecode execution.

So it's a bytecode interpreter, not a JIT.

It might still be production ready for a bunch of use cases. I may use it as a scripting layer for some pluggable piece of software or a game. I wouldn't consider it appropriate for a "production ready web browser" which intends to compete with Firefox and Chrome.

EDIT: Also for some reason all its components are called v8_something? That's pretty off putting, you can't just take another project's name like that.. and from the author's Reddit comments it seems to be mostly AI slop anyway. I'm guessing Claude wrote the "production ready" part on the website, I wouldn't trust it.

mort96 · 2026-04-13T14:55:11 1776092111

The fundamental problem with Rust versioning is that 0.3.5 is compatible with 0.3.6, but not 0.4.0 or 1.0.0; when major version is 0, the minor takes the role of major and patch takes the role of minor. So packages iterate through 0.x versions, and eventually, they reach a version that's "stable".

If version 0.7 turned out to hit the right API and not require backward incompatible changes, releasing a version 1.0 would be as disruptive as a major version change to your users and communicate through version semantics that it is a breaking change.

Semver declares that version 0.x is for initial development where there is no stability guarantee at all. This is the right semantics for a versioning system, but Cargo doesn't follow this part of semver. Providing stability guarantees throughout the 0.x cycle inevitably results in projects getting stuck in 0.x.

This is one of my biggest gripes with Cargo. But Rust people seem to universally consider it a non-issue so I don't think it'll ever be fixed.

sheepscreek · 2026-04-13T15:05:39 1776092739

> The fundamental problem with Rust versioning is that 0.3.5 is compatible with 0.3.6, but not 0.4.0 or 1.0.0

That’s a feature of semver, not a bug :)

Long answer: You are right to notice that minor versions within a major release can introduce new APIs and changes but generally, should not break existing APIs until the next major release.

However, this rule only applies to libraries after they reach 1.0.0. Before 1.0.0, one shouldn’t expect any APIs to be frozen really.

mort96 · 2026-04-13T15:10:52 1776093052

No, it's explicitly not. Semver says:

> Major version zero (0.y.z) is for initial development. Anything MAY change at any time. The public API SHOULD NOT be considered stable.

Cargo is explicitly breaking with Semver by considering 0.3.5 compatible with 0.3.6.

demurgos · 2026-04-13T15:45:53 1776095153

To go further, semver provides semantics and an ordering but it says nothing about version requirement syntax. The caret operator to describe a range of versions is not part of the spec. It was introduced by initial semver-aware package managers such as npm or gem. Cargo decided to default to the caret operator, but it's still the caret operator.

In practice, there's no real issue with using the first non-zero component to define the group of API-compatible releases and most package managers agree on the semantics.

steveklabnik · 2026-04-13T16:25:27 1776097527

Thank you.

Eventually this will get cleared up. I’m close than I’ve ever been to actually handling this, but it’s been 9 years already, so what’s another few months…

kibwen · 2026-04-13T19:11:33 1776107493

> If version 0.7 turned out to hit the right API and not require backward incompatible changes, releasing a version 1.0 would be as disruptive as a major version change

Nope, this is what the semver trick is for: https://github.com/dtolnay/semver-trick

TL;DR: You take the 0.7 library, release it as 1.0, then make a 0.7.1 release that does nothing other than depend on 1.0 and re-export all its items. Tada, a compatible 1.0 release that 0.7 users will get automatically when they upgrade.

Even more interesting is that you can use this to coordinate only partially-breaking changes, e.g. if you have 100 APIs in your library but only make a breaking change to one, you can re-export the 99 unbroken APIs and only end up making breaking changes in practice for users who actually use the one API with breaking changes.

Starlevel004 · 2026-04-13T15:19:05 1776093545

The standard library has a whole bunch of tools to let them test and evolve APIs with a required-opt in, but every single ecosystem package has to get it right first try because Cargo will silently forcibly update packages and those evolution tools aren't available to third party packages.

Such a stupid state of affairs.

moron4hire · 2026-04-13T15:05:16 1776092716

Personally, I think the 0 major version is a bad idea. I hear the desire to not want to have to make guarantees about stability in the early stages of development and you don't want people depending on it. But hiding that behind "v0.x" doesn't change the fact that you are releasing versions and people are depending on it.

If you didn't want people to depend on your package (hence the word "dependency") then why release it? If your public interface changes, bump that major version number. What are you afraid of? People taking your project seriously?

jaapz · 2026-04-13T15:19:17 1776093557

0.x is not that you don't want people depending on it, you just don't want them to come and complain when you quickly introduce some breaking changes. The project is still in development, it might be stable enough for use in "real projects(tm)", but it might also still significantly change. It is up to the user to decide whether they are OK with this.

1.x communicates (to me at least) you are pretty happy with the current state of the package and don't see any considerable breaking changes in the future. When 2.x comes around, this is often after 1.x has been in use for a long time and people have raised some pain points that can only be addressed by breaking the API.

OtomotO · 2026-04-13T15:46:25 1776095185

But people will complain, so ex falso quodlibet

moron4hire · 2026-04-13T15:23:59 1776093839

If you are at the point that other people can use your software, then you should use v1. If you are not ready for v1, then you shouldn't be releasing to other people.

Because this comment, "The project is still in development, it might be stable enough for use in "real projects(tm)", but it might also still significantly change." That describes every project. Every project is always in development. Every project is stable until it isn't. And when it isn't, you bump the major number.

the__alchemist · 2026-04-13T15:54:43 1776095683

I think we can come up with a reason why bumping the version number each breaking change isn't an elegant solution either: You would end up with version numbers in the hundreds or thousands.

zokier · 2026-04-13T18:47:22 1776106042

Browser version numbers are in the hundreds and it doesn't seem to be a problem.

the__alchemist · 2026-04-13T18:55:03 1776106503

Indeed! I think both 0-based versioning, and this (maybe?) downside I bring up addresses the tension between wanting to limit the damage caused by breaking changes with retaining the ability to make them.

mort96 · 2026-04-13T15:13:01 1776093181

Versioning is communication. I find it useful to communicate, through using version 0.x, "this is not a production ready library and it may change at any time, I provide no stability guarantees". Why might I release it in that state? Because it might still be useful to people, and people who find it useful may become contributors.

moron4hire · 2026-04-13T15:21:34 1776093694

Any project may change at any time. That's why they bump from v1 to v2. But by not using the full precision of the version number, you're not able to communicate as clearly about releases. A minor release may not be 100% compatible with the previous version, but people still expect some degree of similarity such that migrating is not a difficult task. But going from v0.n to v0.(n+1) uses that field to communicate "hell, anything could happen, YOLO."

Nobody cares that Chrome's major version is 147.

mort96 · 2026-04-13T15:25:32 1776093932

By releasing a library with version 1.0, I communicate: "I consider this project to be in a state where it is reasonable to depend on it".

By releasing a library with version 0.x, I communicate: "I consider this project to be under initial development and would advice people not to depend on in unless you want to participate in its initial development".

I don't understand why people find this difficult or controversial.

steveklabnik · 2026-04-13T18:34:38 1776105278

There is additional subtlety here.

For example, sometimes projects that have a 0.y version get depended on a lot, and so moving to 1.0.0 can be super painful. This is the case with the libc crate in Rust, which the 0.1.0 -> 0.2.0 transition was super painful for the ecosystem. Even though it should be a 1.0.0 crate, it is not, because the pain of causing an ecosystem split isn't considered to be worth the version number change.

mort96 · 2026-04-13T22:22:05 1776118925

Oh hey I recently saw a comment which discussed this exact issue: https://news.ycombinator.com/item?id=47752915

steveklabnik · 2026-04-13T22:44:51 1776120291

99% of the time this situation is okay, because Cargo allows you to have both 0.1 and 0.2 in the same project as dependencies. It's just packages that call out to external dependencies, like libc, where it enforces the single version rule.

mort96 · 2026-04-13T23:13:48 1776122028

You can have both 0.1 and 0.2 in the same project, but you really don't want to.

mort96 · 2026-04-12T12:29:08 1775996948

This looks like a motte-and-bailey. Your original comment, the bailey: "he posted much religious stuff from official Intel accounts". This comment, the motte: "he posted much religious stuff".

You could've just said, "I thought it was extremely strange that he posted so much religious stuff while CEO". That would've been a very defensible position. You didn't need the false part of posting from official Intel accounts.

Or, if it was an honest mistake, you should've written something along the lines of: "Sorry, I misremembered, it was his personal accounts. But when you're a CEO, I don't think the distinction matters much; anything you post will be read as 'Intel's CEO says'".

mort96 · 2026-04-12T12:25:32 1775996732

I don't understand what you mean. Stock price has tripled since last August based on Intel finally having a competitive architecture and a competitive process again, no? At least that in combination with various geopolitical circumstances. Sounds like Pat's decision resulted in Intel's stock price rising?

melling · 2026-04-12T13:08:27 1775999307

“The stock price has tripled since last August”

That’s just a statement of fact. I offered no other analysis.

mort96 · 2026-04-12T13:12:53 1775999573

This seems like analysis? "Pat killed Intel’s share price."

melling · 2026-04-12T15:36:09 1776008169

I suppose if you just look at a chart…

mort96 · 2026-04-12T16:14:29 1776010469

Then what?

mort96 · 2026-04-10T11:48:37 1775821717

Oh yes, I've also suddenly had my workspace switcher turn black! Never happened before but it has started happening lately.

macOS has never been bug free, but it feels like they've really been working hard to introduce new bugs lately.

mort96 · 2026-04-10T11:47:22 1775821642

Complaining about the distinction between apps and windows isn't a "stupid reason" to complain though.

Say I use Slack, Teams and Outlook. If I use their Electron versions, I switch between them with cmd+tab. If I use them in separate browser windows, I switch between them by using cmd+tab to switch to Firefox, then cmd+` to cycle through windows until I find the one I want. That's weird; how you switch between these three apps depend on the technical details on how you opened them? Why?

Say I have neovim, the mutt email client, and a shell open. These are three separate apps, but because they happen to run in a terminal emulator, I still have to cmd+tab to the terminal emulator, then cmd+` to cycle between them. They're semantically different applications in dedicated windows, but technical implementation details mean they belong to what macOS considers "the same app", just like the "apps in Firefox windows" example above.

It wouldn't be so bad if the cmd+` "cycle between windows in the app" feature worked well. But it doesn't. Unlike cmd+tab, it doesn't show a bar which you get to select from, it just instantly re-orders your windows; and it's impossible to select a window in another workspace. That means, if I have Slack open in Firefox in workspace 1 and Outlook open in Chrome in workspace 2, I can switch between Slack and Outlook with cmd+tab, but if I Slack open in Firefox in workspace 1 and Outlook open in Firefox in workspace 2, there is no way to switch between Slack and Outlook. That's pretty bad.

igregoryca · 2026-04-10T12:50:22 1775825422

The (shift+)cmd+` order also resets to match the window z-order whenever you switch apps. So if the order is windows A, B, C, then you select window B, cmd+tab away, then cmd+tab back, the order will now be B, A, C.

I've developed an intuitive understanding of this, but I had to experiment just now to describe the behavior precisely. And my intuition is still wrong sometimes (like if the app has windows on multiple monitors, it's hard to predict the z-order).

> if I Slack open in Firefox in workspace 1 and Outlook open in Firefox in workspace 2, there is no way to switch between Slack and Outlook

My local maximum is to never use workspaces – just cmd+tab, cmd+`, and sometimes cmd+h to reduce screen clutter.

veber-alex · 2026-04-10T12:36:41 1775824601

I would also add to this that in order to open two instances of an app the app explicitly needs to support this. For example, you can't open 2 instances of Calculator.app side by side.

This is really annoying.

coldtea · 2026-04-10T14:38:06 1775831886

Yeah, I always want 2 calculator apps when I'm speed calculating... what?

mort96 · 2026-04-10T15:29:23 1775834963

You may want to see the result of one calculation while doing another calculation?

coldtea · 2026-04-10T23:04:05 1775862245

If only I could write it down in a text note and refer to it and many more, as opposed to keeping a calculator window open per prior calculation I want to refer to...

Or if only there was a tool like Soulver or Calca...

mort96 · 2026-04-10T23:28:02 1775863682

Yes you can write down the result of calculations and all relevant state and then clear the state of the calculator window before doing a new calculation. But why?

mort96 · 2026-04-10T11:36:14 1775820974

The closest thing you can do on macOS is to turn on "reduced motion". This doesn't remove any animations, it just replaces them all with fade animations which take the same amount of time.

fragmede · 2026-04-10T18:25:00 1775845500

and set it to 60hz instead of 120. And install the tool that the article links to.