I've always said this but AI will win a fields medal before being able to manage...

vatsachak · 2026-03-28T21:23:24 1774733004

Tricks are nothing but patterns in the logical formulae we reduce.

Ergo these are latent vectors in our brain. We use analogies like geometry in order to use Algebraic Geometry to solve problems in Number Theory.

An AI trained on Lean Syntax trees might develop it's own weird versions of intuition that might actually properly contain ours.

If this sounds far fetched, look at Chess. I wonder if anyone has dug into StockFish using mechanistic interpretability

myffical · 2026-03-28T22:38:22 1774737502

Some DeepMind researchers used mechanistic interpretability techniques to find concepts in AlphaZero and teach them to human chess Grandmasters: https://www.pnas.org/doi/10.1073/pnas.2406675122

hodgehog11 · 2026-03-28T22:58:02 1774738682

This argument, that LLMs can develop new crazy strategies using RLVR on math problems (like what happened with Chess), turns out to be false without a serious paradigm shift. Essentially, the search space is far too large, and the model will need help to explore better, probably with human feedback.

https://arxiv.org/abs/2504.13837

narrator · 2026-03-28T23:44:50 1774741490

The search space for the game of Go was also thought to be too large for computers to manage.

thesz · 2026-03-29T07:18:51 1774768731

It still is [1].

[1] https://www.vice.com/en/article/a-human-amateur-beat-a-top-g...

stalfie · 2026-03-29T11:19:32 1774783172

The blind spot exploiting strategy you link to was found by an adverserial ML model...

sealeck · 2026-03-29T01:19:27 1774747167

Yes and making a horse drawn cart drive itself was thought to be impossible so why don't we have faster than light travel yet...

Finbel · 2026-03-29T06:43:48 1774766628

Yes but "the search space is too large" is something that has been said about innumerable AI-problems that were then solved. So it's not unreasonable that one doubts the merit of the statement when it's said for the umpteenth time.

hodgehog11 · 2026-03-29T07:27:19 1774769239

I should have been more specific then. The problem isn't that the search space is too large to explore. The problem is that the search space is so large that the training procedure actively prefers to restrict the search space to maximise short term rewards, regardless of hyperparameter selection. There is a tradeoff here that could be ignored in the case of chess, but not for general math problems.

This is far from unsolvable. It just means that the "apply RL like AlphaGo" attitude is laughably naive. We need at least one more trick.

vatsachak · 2026-03-30T00:12:54 1774829574

The other trick could be bootstrapping through mathlib.

As you said brute forcing the search space as the starting procedure would take way too long for the AI to build intuition.

But if we could give it a million or so lemmas of human math, that would be a great starting point.

throwaway27448 · 2026-03-29T08:20:25 1774772425

I agree that LLMs are a bad fit for mathematical reasoning, but it's very hard for me to buy that humans are a better fit than a computational approach. Search will always beat our intuition.

hodgehog11 · 2026-03-29T11:47:14 1774784834

Yes and no. I think we have vastly underestimated the extent of the search space for math problems. I also think we underestimate the degree to which our worldview influences the directions with which we attempt proofs. Problems are derived from constructions that we can relate to, often physically. Consequently, the technique in the solution often involves a construction that is similarly physical in its form. I think measure theory is a prime example of this, and it effectively unlocked solutions to a lot of long-standing statistical problems.

ineedasername · 2026-03-29T18:04:57 1774807497

That linked article says its about RLVR but then goes on to conflate other RL with it, and doesn't address much in the way of the core thinking that was in the paper they were partially responding to that had been published a month earlier[0] which laid out findings and theory reasonably well, including work that runs counter to the main criticism in the article you cited, ie, performance at or above base models only being observed with low K examples.

That said, reachability and novel strategies are somewhat overlapping areas of consideration, and I don't see many ways in which RL in general, as mainly practiced, improves upon models' reachability. And even when it isn't clipping weights it's just too much of a black box approach.

But none of this takes away from the question of raw model capability on novel strategies, only such with respect to RL.

[0] https://arxiv.org/pdf/2506.14245

slopinthebag · 2026-03-28T21:33:02 1774733582

Stockfish's power comes from mostly search, and the ML techniques it uses are mainly about better search, i.e. pruning branches more efficiently.

vatsachak · 2026-03-28T21:37:16 1774733836

The weights must still have some understanding of the chess board. Though there is always the chance that it makes no sense to us

emp17344 · 2026-03-28T21:53:22 1774734802

Why must it involve understanding? I feel like you’re operating under the assumption that functionalism is the “correct” philosophical framework without considering alternative views.

slopinthebag · 2026-03-28T21:41:18 1774734078

Even that is probably too much. It has no understanding of what "chess" is, or what a chess board is, or even what a game is. And yet it crushes every human with ease. It's pretty nuts haha.

anematode · 2026-03-28T21:51:35 1774734695

Actually, the neural net itself is fairly imprecise. Search is required for it to achieve good play. Here's an example of me beating Stockfish 18 at depth 1: https://lichess.org/XmITiqmi

Sopel · 2026-03-28T21:54:20 1774734860

chess is just a simple mathematical construct so that's not surprising

PowerElectronix · 2026-03-29T10:29:56 1774780196

There is no understanding, the weights are selected based on better fit. Our cells have no understanding of optics just because they have the eyes coded in their DNA.

hollerith · 2026-03-28T21:44:14 1774734254

Does Stockfish have weights or use a neural net? I know older versions did not.

Sopel · 2026-03-28T21:50:35 1774734635

Sopel · 2026-03-28T21:51:06 1774734666

The ML techniques it uses are only about evaluation, but you were close

hodgehog11 · 2026-03-28T23:00:43 1774738843

As a professional mathematician, I would say that a good proof requires a very good representation of the problem, and then pulling out the tricks. The latter part is easy to get operating using LLMs, they can do it already. It's the former part that still needs humans, and I'm perfectly fine with that.

vatsachak · 2026-03-30T00:03:27 1774829007

I guess I'm using Rota's vocabulary where he implicitly uses the word trick to mean representation

threethirtytwo · 2026-03-29T00:00:12 1774742412

But are you ok with the trendline of ai improvement? The speed of improvement indicates humans will only get further and further removed from the loop.

I see posts like your all the time comforting themselves that humans still matter, and every-time people like you are describing a human owning an ever shrinking section of the problem space.

hodgehog11 · 2026-03-29T07:38:52 1774769932

I used to be worried, but not so much anymore.

It used to be the case that the labs were prioritising replacing human creativity, e.g. generative art, video, writing. However, they are coming to realise that just isn't a profitable approach. The most profitable goal is actually the most human-oriented one: the AI becomes an extraordinarily powerful tool that may be able to one-shot particular tasks. But the design of the task itself is still very human, and there is no incentive to replace that part. Researchers talk a bit less about AGI now because it's a pointless goal. Alignment is more lucrative.

Basically, executives want to replace workers, not themselves.

latentsea · 2026-03-29T12:41:46 1774788106

On the contrary the depth and breadth we're becoming able to handle agentically now in software is growing very rapidly, to the point where in the last 3 months the industry has undergone a big transformation and our job functions are fundamentally starting to change. As a software engineer I feel increasingly like AGI will be a real thing within the next few years, and it's going to affect everyone.

threethirtytwo · 2026-03-29T19:55:23 1774814123

I don’t write code anymore. I don’t use ide’s anymore. The agent writes code. My job is to manage ai now.

The paradigm shift has already happened to me and there will be more shifts to come.

k33d · 2026-03-30T00:20:12 1774830012

"to the point where in the last 3 months the industry has undergone a big transformation "

Oh... this again.

latentsea · 2026-04-01T13:21:18 1775049678

If you look at those operating at the bleeding edge, it doesn't look anything like yesteryear. It's a real step change. Fully autonomous agentic software engineering is becoming a reality. While still in its infancy, some results are starting to be made public, and it's mind boggling. We're transitioning to a full agent-only workflow in my team at work. The engineering task has shifted from writing code to harness engineering, and essentially building a system that can safely build itself to a high quality given business requirements.

Up until recently I kinda feel like the scepticism was warranted, but after building my own harness that can autonomously produce decent quality software (at least for toy problem scale, granted), and getting hands on with autoresearch via writing a set of skills for it https://github.com/james-s-tayler/lazy-developer, I feel fundamentally different about software engineering than I did until relatively recently.

If you look at the step change from Sonnet 4.5 to Opus 4.5 and what that unlocked, and consider the rumoured Mythos model is apparently not just an incremental improvement, but another step change. Then pair it with infrastructure for operating agents at scale like https://github.com/paperclipai/paperclip and SOTA harnesses like the ones being written about on the blogs of the frontier labs... I mean... you tell me what you think is coming down the pipe?

tartoran · 2026-03-29T00:22:39 1774743759

Humans needing to ask new question due to curiosity push the boundaries further, find new directions, ways or motivations to explore, maybe invent new spaces to explore. LLMs are just tools that people use. When people are no longer needed AI serves no purpose at all.

threethirtytwo · 2026-03-29T00:33:44 1774744424

Who said LLMs can’t push boundaries either?

People can use other people as tools. An LLM being a tool does not preclude it from replacing people.

Ultimately it’s a volume problem. You need at least one person to initialize the LLM. But after that, in theory, a future LLM can replace all people with the exception of the person who initializes the LLM.

tossandthrow · 2026-03-29T03:44:55 1774755895

The initialization problem is solved - maybe the next Nobel price will be given to a Mac mini.

madrox · 2026-03-28T21:52:47 1774734767

> I've always said this but AI will win a fields medal before being able to manage a McDonald's.

I love this and have a corollary saying: the last job to be automated will be QA.

This wave of technology has triggered more discussion about the types of knowledge work that exist than any other, and I think we will be sharper for it.

bitwize · 2026-03-28T22:04:25 1774735465

The ownership class will be sharper. They will know how to exploit capital and turn it into more capital with vastly increased efficiency. Everybody else will be hosed.

madrox · 2026-03-29T01:05:43 1774746343

I'm not sure if people will be more hosed than before. Historically, what makes people with capital able to turn things into more capital is its ability to buy someone's time and labor. Knowledge labor is becoming cheaper, easier, and more accessible. That changes the calculus for what is valuable, but not the mechanisms.

tmoertel · 2026-03-29T11:34:30 1774784070

> Historically, what makes people with capital able to turn things into more capital is its ability to buy someone's time and labor.

You forgot to include resources:

What makes people with capital able to turn things into more capital is their ability to buy labor and resources. If people with more capital can generate capital faster than people with less capital, then (unless they are constrained, for example, by law or conscious) the people with the most capital will eventually own effectively all scarce resources, such as land. And that's likely to be a problem for everyone else.

madrox · 2026-03-29T17:50:33 1774806633

Fair, though I don’t see how AI is really changing the equation here

tmoertel · 2026-03-29T19:22:56 1774812176

AI doesn't change the equation; it makes the equation more brutal for people who don't have capital.

If you don't have capital, the only way to get it is by trading resources or labor for it. Most poor people don't have resources, but they do have the ability to do labor that's valued. But AI is a substitute for labor. And as AI gets better, the value of many kinds of labor will go towards zero.

If it was hard for poor people to escape poverty in the past, it's going to be even harder with AI. Unless we change something about the structure of society to ensure that the benefits of AI are shared with poor people.

madrox · 2026-03-29T22:14:08 1774822448

Ok, I'm following you. You're saying because labor gets cheaper it will be harder to make a living providing labor. Not disagreeing, but I wonder how much weight to give this argument. History shows a precedent of productivity revolutions changing the workforce, but not eliminating it, and lifting the quality of life of the population overall (though it does also create problems). Mixed bag with the arc bending towards betterment for all. You could argue that this moment is unprecedented in history, but unless the human spirit changes, for better or worse, we will adapt as we always have, rich and poor alike.

If the value of many kinds of labor go towards zero, those benefits also go to the poor. ChatGPT has a free tier. The method of escaping poverty will still be the same. Grow yourself. Provide value to your community.

bitwize · 2026-03-30T15:27:01 1774884421

Entire classes of workers have been put in the poorhouse on a near permanent basis due to technological changes, many tines during the past two centuries of industrial civilization. Without systemic structural changes to support the workforce this will happen/is already happening with AI.

zer00eyz · 2026-03-29T03:42:56 1774755776

There is a fundamental problem with this thinking, you are making an assumption about scale. There is the apocryphal quote "I think there is a world market for maybe five computers".

You have to believe that LLM scaling (down) is impossible or will never happen. I assure you that this is not the case.

DoctorOetker · 2026-03-29T00:56:23 1774745783

but what if we succeed in gamifying the latent knowledge in LLM's to upload it to our human brains, by some kind of speed / reaction game?

pfdietz · 2026-03-29T13:17:13 1774790233

> Any professional mathematician will tell you that their arsenal is ~ 10 tricks. If we can codify those tricks as latent vectors it's GG

And if we can train the systems to discover new tricks, whoa Nelly.

ryanar · 2026-03-28T22:36:32 1774737392

Are they actually producing new math? In the most recent ACM issue there was an article about testing AI against a math bench that was privately built by mathematicians, and what they found is that even though AI can solve some problems, it never truly has come up with something novel and new in mathematics, it is just good at drawing connections between existing research and putting a spin on it.

in-silico · 2026-03-29T02:06:26 1774749986

I'm not accusing you in particular, but I feel like there's a lot of circular reasoning around this point. Something like: AI can't discover "new math" -> AI discovers something -> since it was discovered by AI it must not be "new math" -> AI can't discover "new math"

For example, there was a recent post here about GPT-5.4 (and later some other models) solving a FrontierMath open problem: https://news.ycombinator.com/item?id=47497757

That would definitely be considered "new math" if a human did it, but since it was AI people aren't so sure.

parineum · 2026-03-29T04:14:14 1774757654

There is a kind of rubrik I use on stuff like this. If LLMs are discovering new math, why have I only read one or two articles where it's happening? Wouldn't it be happening with regularity?

The most obvious example of this thinking is, if LLMs are replacing developers, why us open ai still hiring?

specvsimpl · 2026-03-29T04:44:38 1774759478

I can only say that at family meetings, I hear people talk about contracting with a shop that used to have 4 web designers, but now it's 1 guy, delivering 4x faster than before.

So devs are being replaced.

ori_b · 2026-03-29T11:59:48 1774785588

Why aren't they delivering 4x more work? Does the world no longer need software?

Bombthecat · 2026-03-29T07:53:02 1774770782

Nah AI is not replacing people! /s

And other stories people tell themselves to sleep better at night

hodgehog11 · 2026-03-28T22:41:01 1774737661

It's finding constructions and counterexamples. That's different from finding new proof techniques, but still extremely useful, and still gives way to novel findings.

Yoric · 2026-03-29T09:49:59 1774777799

> I predict that in the future people will ditch LLMs in favor of AlphaGo style RL done on Lean syntax trees. These should be able to think on much larger timescales.

This is certainly my hope.

In my spare time, I'm slowly, very slowly, inching towards a prototype of something that could work like that.

slopinthebag · 2026-03-28T21:34:32 1774733672

> AI will win a fields medal before being able to manage a McDonald's

Of course, because it takes multi-modal intelligence to manage a McDonalds. I.e. it requires human intelligence.

> I predict that in the future people will ditch LLMs in favor of AlphaGo style RL

Same for coding as well. LLM's might be the interface we use with other forms of AI though.

vatsachak · 2026-03-28T21:40:32 1774734032

Something like building Linux is more akin to managing a McDonald's than it is to a 10 page technical proof in Algebraic Groups.

Programming is more multimodal than math.

Something like performance engineering might be free lunch though

hodgehog11 · 2026-03-28T22:52:28 1774738348

> Programming is more multimodal than math

I have no idea how you come to this conclusion, when the evidence on the ground for those training models suggests it is precisely the opposite.

We are much further along the path of writing code than writing new maths, since the latter often requires some degree of representational fluency of the world we live in to be relevant. For example, proving something about braid groups can require representation by grid diagrams, and we know from ARC-AGI that LLMs don't do great with this.

Programming does not have this issue to the same extent; arguably, it involves the subset of maths that is exclusively problem solving using standard representations. The issues with programming are primarily on the difficulty with handling large volumes of text reliably.

vatsachak · 2026-03-29T23:57:33 1774828653

Grid Diagrams can be specified (hopefully) through algebraic equations.

The way that most math is currently done is that someone provides an extremely specified problem and then one has to answer that extremely specified problem.

The way that programming is currently done is through constructing abstractions and trying to create a specification of the problem.

Of course I'm not saying we're close to creating a silicon Grothendieck (I think that Bourbaki actually reads like a codebase) but I'm saying that we're much closer to constructing algorithms that can solve specified programs as opposed to specifying underspecified problems

Think about the difference in specificity of

Prove Fermat's last theorem vs Build a web browser

zeroonetwothree · 2026-03-29T03:10:19 1774753819

I guess the comment you are replying to really meant to say “software engineering” not “programming”.

slopinthebag · 2026-03-29T01:09:04 1774746544

Nah, LLM's are solving unique problems in maths, whereas they're basically just overfitting to the vast amounts of training data with writing code. Every single piece of code AI writes is essentially just a distillation of the vast amounts of code it's seen in it's training - it's not producing anything unique, and it's utility quickly decays as soon as you even move towards the edge of the distribution of it's training data. Even doing stuff as simple as building native desktop UI's causes it massive issues.

slopinthebag · 2026-03-28T21:48:00 1774734480

Yeah, it's hard to compare management and programming but they're both multimodal in very different ways. But there's gonna be entire domains in which AI dominates much like stockfish, but stockfish isn't managing franchises and there is no reason to expect that anytime soon.

I feel like something people miss when they talk about intelligence is that humans have incredible breadth. This is really what differentiates us from artificial forms of intelligence as well as other animals. Plus we have agency, the ability to learn, the ability to critically think, from first principles, etc.

vatsachak · 2026-03-28T21:50:32 1774734632

Exactly. It's what the execs are missing.

Also animals thrive in underspecified environments, while AIs like very specific environments. Math is the most specified field there is lol

slopinthebag · 2026-03-29T00:39:58 1774744798

Oooh yeah that's really good framing. Humans have been building machines that outperform humans for hundreds of years at this point, but all in problems which are extremely well specified. It's not surprising LLM's are also great in these well specified domains.

One difference between intelligence and artificial intelligence is that humans can thrive with extremely limited training data, whereas AI requires a massive amount of it. I think if anybody is worried about being replaced by AI, they should look at maximising their economic utility in areas which are not well specified.

vatsachak · 2026-03-30T00:08:23 1774829303

Exactly. I would not want to have a pure math career or a performance engineering career in 10 years.

gottheUIblues · 2026-03-29T12:54:52 1774788892

So specified .. that it can actually prove it can't be completely specified by any single specification

vatsachak · 2026-03-30T00:07:17 1774829237

All mathematical statements we care about fall out of the purview of incompleteness

gottheUIblues · 2026-04-02T12:38:33 1775133513

To the contrary (as summarised by Gemini):

Gödel showed that arithmetic cannot prove everything about itself.

Turing showed that computers cannot predict everything about themselves.

Rice showed that we cannot automatically verify what programs will do.

Chaitin showed that mathematics is full of random, unprovable facts.

Lawvere showed that they are all failing for the exact same structural reason!

These are not fringe issues. They define the absolute boundaries of human and machine intelligence.

bitwize · 2026-03-28T22:12:40 1774735960

But LLMs have proven themselves better at programming than most professional programmers.

Don't argue. If you think Hackernews is a representative sample of the field then you haven't been in the field long enough.

What LLMs have actually done is put the dream of software engineering within reach. Creativity is inimical to software engineering; the goal has long been to provide a universal set of reusable components which can then be adapted and integrated into any system. The hard part was always providing libraries of such components, and then integrating them. LLMs have largely solved these problems. Their training data contains vast amounts of solved programming problems, and they are able to adapt these in vector space to whatever the situation calls for.

We are already there. Software engineering as it was long envisioned is now possible. And if you're not doing it with LLMs, you're going to be left behind. Multimodal human-level thinking need only be undertaken at the highest levels: deciding what to build and maybe choosing the components to build it. LLMs will take care of the rest.

abcde666777 · 2026-03-28T22:32:05 1774737125

A bit optimistic I'd say. It's put some software engineering within reach of some people who couldn't do it prior. Where 'some' might be a lot, but still far from all.

I was thinking the other day of how things would go if some of my less tech savvy clients tried to vibe code the things I implement for them, and frankly I could only imagine hilarity ensuing. They wouldn't be able to steer it correctly at all and would inevitably get stuck.

Someone needs to experiment with that actually: putting the full set of agentic coding tools in the hands of grandma and recording the outcome.

bitwize · 2026-03-28T23:45:06 1774741506

It's still going to take a knowledgeable person to steer an LLM. The point is that code written entirely by humans is finished as a concept in professional work—if you're writing it yourself you're not working efficiently or employing industry best practice.

abcde666777 · 2026-03-29T02:53:55 1774752835

I think it's dramatic to say it's the end of hand written code. That's like saying it's the end of bespoke suits. There are scenarios where carefully hand written and reviewed code are still going to have merit - for example the software for safety critical systems such as space shuttles and stations, or core logic within self-driving vehicles.

Basically when every single line needs to be reviewed extremely closely the time taken to write the code is not a bottleneck at all, and if using AI you would actually gain a bottleneck in the time spent removing the excess and superfluous code it produces.

And my intuition is that the line between those two kinds of programming - let's call them careful and careless programming to coin an amusing terminology - I think that line may not shrink as far back as some think, and I think it definitely won't shrink to zero.

specvsimpl · 2026-03-29T05:03:39 1774760619

You are aware of software verification? The AI can prove (mathematically) that its code implements the spec.

abcde666777 · 2026-03-29T09:39:16 1774777156

That just takes you back to the debate about the code being the spec.

986aignan · 2026-03-29T12:29:01 1774787341

The code lets you shoot yourself in the foot in a lot more ways than a spec does, though. Few people would make specs that include buffer overflows or SQL injection.

magicalist · 2026-03-29T15:32:59 1774798379

"and don't have any security vulnerabilities" isn't a spec though. As soon as you get specific you're right back in it.

slopinthebag · 2026-03-29T00:39:44 1774744784

That is akin to saying if you aren't using an IDE you are not working efficiently or employing industry best practice, which is insane when you consider people using Vi often run rings around people using IDEs.

AI usage is a useless metric, look at results. Thus far, results and AI usage are uncorrelated.

bitwize · 2026-03-29T18:20:21 1774808421

I keep hearing anecdata that suggest significant to huge productivity increases—"a task that would have taken me weeks now takes hours" is common. There is currently not a whole lot of research that supports that, however:

1) there hasn't been a whole lot of research into AI productivity period;

2) many of the studies that have been done (the 2025 METR study for example) are both methodologically flawed and old, not taking into account the latest frontier models

3) corporate transitions to AI-first/AI-native organizations are nowhere near complete, making companywide productivity gains difficult to assess.

However, it isn't hard to find stories on Hackernews from devs about how much time generative AI has saved them in their work. If the time savings is real, and you refuse to take advantage of it, you are stealing from your employer and need to get with the program.

As for IDEs, if you're working in C# and not using Visual Studio, or Java and not using JetBrains, then no—you are not working as efficiently as you could be.

slopinthebag · 2026-03-29T00:38:07 1774744687

Actually I will argue. Complex systems are akin to a graph, attributes of the system being the nodes and the relationships between those attributes being the edges. The type of mechanistic thinking you're espousing is akin to a directed acyclic graph or a tree, and converting an undirected cyclic graph into a tree requires you to disregard edges and probably nodes as well. This is called reductionism, and scientific reductionism is a cancer for understanding complex phenomena like sociology or economics, and I posit, software as well.

People and corporations have been trying for at least the last five decades to reduce software development to a mechanistic process, in which a system is understandable solely via it's components and subcomponents, which can then be understood and assembled by unskilled labourers. This has failed every time, because by reducing a graph to a DAG or tree, you literally lose information. It's what makes software reuse so difficult, because no one component exists in isolation within a system.

The promise of AI is not that it can build atomic components which can be assembled like my toaster, but rather that it can build complex systems not by ignoring the edges, but managing them. It has not shown this ability yet at scale, and it's not conclusive that current architectures ever will. Saying that LLM's are better than most professional programmers is also trivially false, you do yourself no favours making such outlandish claims.

To tie back into your point about creativity, it's that creativity which allows humans to manage the complexity of systems, their various feedback loops, interactions, and emergent behaviour. It's also what makes this profession broadly worthwhile to its practitioners. Your goal being to reduce it to a mechanistic process is no different from any corporation wishing to replace software engineers with unskilled assembly line workers, and also completely misses the point of why software is difficult to build and why we haven't done that already. Because it's not possible, fundamentally. Of course it's possible AI replaces software developers, but it won't be because of a mechanistic process, but rather because it becomes better at understanding how to navigate these complex phenomena.

This might be besides the point, but I also wish AI boosters such as yourself would disclose any conflict of interests when it comes to discussing AI. Not in a statement, but legally bound, otherwise it's worthless. Because you are one of the biggest AI boosters on this platform and it's hard to imagine the motivation of spending so much time hardlining a specific narrative just for the love of the game, so to speak.

bitwize · 2026-03-30T18:19:13 1774894753

> Saying that LLM's are better than most professional programmers is also trivially false, you do yourself no favours making such outlandish claims.

You grossly underestimate how awful one can be and still call themselves a "professional" in the field. Software engineering has effectively no standard certification of competence, which is part of why it's not actually an engineering field at all. So I stand by my statement that LLMs are better at writing code than most people working professionally as programmers. Again, Hackernews is not a representative sample, let alone the kind of programmer we admire and view as authoritative here on Hackernews. Most programmers require considerable oversight as well as detailed standards to follow in order to produce work without gumming up a code base. If you want to know why so much enterprise stuff is so bloated with heavy frameworks and a twisty maze of best practices like OO, SOLID, GoF patterns, etc. it's for this reason. The LLMs have access to a vast (if compressed/summarized) repository of knowledge about programming problems and commonly employed solutions in a variety of languages, and the ability to draw upon it instantly. Most humans, including myself, do not.

Anyway, as Tim Bryce observed in 2005, based on his father Milt's work in the 70s, most of the creativity and human in software development happens in the business/systems analysis phase, not programming, at least if you're employing a structured, rigorous, proven methodology. Milt Bryce turned systems design from an art into a proven, repeatable science, and with that a view of programming that's largely mechanistic. "There are very few true artists in programming; most programmers are just house painters."

> This might be besides the point, but I also wish AI boosters such as yourself would disclose any conflict of interests when it comes to discussing AI.

I'm not boosting squat. I'm telling it like it is, and talking about decisions in our field that have already been made. It is no longer up for debate that AI use is an integral part of software engineering now, and writing code "the old way", in an editor with maybe autocomplete, refactoring tools, etc., will soon go the way of punchcards. The business class that actually runs things has already decided this. If you're getting suspicious and demanding conflict-of-interest disclosures from someone who spells this out, your understanding is out of date.

kelseyfrog · 2026-03-28T23:36:16 1774740976

As of now, no models have solved a Millennium Prize Problem[1].

1. https://mppbench.com/

raincole · 2026-03-29T05:31:39 1774762299

Most Fields medals winners haven't either, except one.

utopcell · 2026-03-29T05:15:20 1774761320

This is the real Litmus test isn't it? There will be a deafening silence from critics when AI decides P vs NP.

3abiton · 2026-03-28T23:34:21 1774740861

It will be heavily still reliant onexpert human input and interactions. Knuth is an expert, and know how to guide.

smokel · 2026-03-28T21:21:25 1774732885

I think this is mostly about existing legislature, not about technology.

In any other context than when your paycheck depends on it, you would probably not be following orders from a random manager. If your paycheck depended on following the instructions of an AI robot, the world might start to look pretty scary real soon.

jfim · 2026-03-29T03:04:01 1774753441

> If your paycheck depended on following the instructions of an AI robot, the world might start to look pretty scary real soon.

That's already the case, minus AI, for gig workers. Their only agency is to accept or decline a ride/delivery, the rest is follow instructions.

throw3747488 · 2026-03-28T22:47:22 1774738042

AI actually has to follow all rules, even the bad rules. Like when autonomous car drives super carefully.

Imagine mcdonald management would enforce dog related rules. No more filthy muppets! If dog harasses customers, AI would call cops, and sue for restraining order! If dog defecates in middle of restaurant, everything would get desinfected, not just smeared with towels!

Nutters would crucify AI management!

vatsachak · 2026-03-28T21:26:50 1774733210

There's a lot to being a manager

- Coherent customer interaction

- Common sense judgements

- Scheduling

- Quality control

All which are baked into humans but not so much into LLMs

Even if it were legal to have an LLM as a GM, I think it would fair poorly

NamlchakKhandro · 2026-03-28T21:34:54 1774733694

I've never seen you say that

vatsachak · 2026-03-28T21:38:06 1774733886

You will have to take my word that I started saying this in Dec 2024 lol