More

AndyNemmity · 2026-01-30T01:57:26 1769738246

My experience agrees with this.

Which is why I use a skill that is a command, that routes requests to agents and skills.

AndyNemmity · 2026-01-14T20:49:07 1768423747

Writing about AI so far, but who knows. Just started it.

AndyNemmity · 2026-01-12T04:39:24 1768192764

I use Hacker News commenters.

There was someone a while ago who made a funny post about the type of Hacker News commenters. So I have 5 of them that will review things, and ended up being way more effective than I ever imagined they'd be.

│ contrarian-provocateur-roaster │ Challenge premises, explore alternatives │ "Have you considered..."

│ enthusiastic-newcomer-roaster │ Accessibility, onboarding friction │ "Wait, how do I even..."

│ pragmatic-builder-roaster │ Operational reality, production concerns │ "This won't survive 3AM pages" │

│ skeptical-senior-roaster │ Long-term maintenance, sustainability │ "Who maintains this in 2 years?" │

│ well-actually-pedant-roaster │ Terminology precision, verifiability │ "Technically, that's not..." │

p1nkpineapple · 2026-01-13T10:19:03 1768299543

This sounds great :D feel like sharing the prompt?

AndyNemmity · 2026-01-14T16:58:16 1768409896

Here is one of the agents. I prefer large agents, so you can tweak it to your purposes. It also calls some of my skills and other pieces, but it will give you the "gist" of it.

https://gist.github.com/notque/e57cb975a3df7780824ce4085a59a...

AndyNemmity · 2026-01-02T18:27:48 1767378468

I keep having the same conversation with people struggling with Claude Code.

Someone tells me it "forgets" their instructions. Or it hallucinates fixes. Or it ignores the rules they put in CLAUDE.md. And when I ask what their setup looks like, it's always the same thing: a massive system prompt with every rule for every language, stuffed into context.

So I wrote up how I solve this.

AndyNemmity · 2026-01-01T00:50:42 1767228642

These are excellent every year, thank you for all the wonderful work you do.

tkgally · 2026-01-01T02:03:40 1767233020

Same here. Simon is one of the main reasons I’ve been able to (sort of) keep up with developments in AI.

I look forward to learning from his blog posts and HN comments in the year ahead, too.

password4321 · 2026-01-01T04:36:02 1767242162

Don't forget you can pay Simon to keep up with less!

> At the end of every month I send out a much shorter newsletter to anyone who sponsors me for $10 or more on GitHub

https://simonwillison.net/about/#monthly

AndyNemmity · 2025-12-30T04:18:47 1767068327

Exactly right, well said. None of these solutions work in this case for the reasons you outlined.

It will just as easily get around it by running it as a bash command or any number of ways.

AndyNemmity · 2025-12-30T02:07:17 1767060437

and put it in all caps, so it knows you mean business.

wellthisisgreat · 2025-12-30T08:24:30 1767083070

alarm emoji alarm emoji alarm emoji

AndyNemmity · 2025-12-30T02:02:48 1767060168

The funny part is, the vast majority of them are barely doing anything at all.

All of these systems are for managing context.

You can generally tell which ones are actually doing something if they are using skills, with programs in them.

Because then, you're actually attaching some sort of feature to the system.

Otherwise, you're just feeding in different prompts and steps, which can add some value, but okay, it doesn't take much to do that.

Like adding image generation to claude code with google nano banana, a python script that does it.

That's actually adding something claude code doesn't have, instead of just saying "You are an expert in blah"

austinbaggio · 2025-12-30T02:07:51 1767060471

It sounds like you've used quite a few. What programs are you expecting? Assuming you're talking about doing some inference on the data? Or optimizing for some RAG or something?

AndyNemmity · 2025-12-30T02:09:55 1767060595

An example of a skill i gave, adding image generation to nano banana.

another is one claude code ships with, using rip grep.

Those are actual features. It's adding deterministic programs that the llm calls when it needs something.

austinbaggio · 2025-12-30T02:38:17 1767062297

Oh got it - tool use

AndyNemmity · 2025-12-30T02:41:05 1767062465

Exactly. That adds actual value. Some of the 1000s of projects do this. Those pieces add value, if the tool adds value which also isn’t a given

troupo · 2025-12-30T08:17:16 1767082636

> You can generally tell which ones are actually doing something if they are using skills, with programs in them.

> Otherwise, you're just feeding in different prompts and steps

"skills" are literally just .md files with different prompts and steps.

> That's actually adding something claude code doesn't have, instead of just saying "You are an expert in blah"

It's not adding anything but a prompt saying "when asked to do X invoke script Y or do steps Z"

AndyNemmity · 2025-12-30T16:49:45 1767113385

Skill are md files, but they are not just that. They are also scripts. That's what adding things are. You can make a skill that is just a prompt, but that misses the point of the value.

You're packaging the tool with the skill, or multiple tools to do a single thing.

troupo · 2025-12-30T21:29:30 1767130170

In the end it's still an .md file pointing to a script that ends being just a prompt for the agent that the agent may or may not pick up, may or may not discover, may or may not forget after context compaction etc.

There's no inherent magic to skills, or any fundamental difference between them and "just feeding in different prompts and steps". It literally is just feeding different prompt and steps.

AndyNemmity · 2025-12-31T17:50:05 1767203405

I find in my experience that it's trivial to have the skill systematically call the script, and perform the action correctly. This has not been a challenge to me.

Also, the pick up or not pick up, or discover or may not discover is solved as well. It's handled by my router, which I wrote about here - https://vexjoy.com/posts/the-do-router/

So these are solved problems to me. There are many more problems which are not solved, which are the interesting space to continue with.

AndyNemmity · 2025-12-30T01:52:15 1767059535

Yeah, at a certainly level, it's just a ton of fun to do. I think that's why so many of us are playing with it.

It's also deeply interesting because it's essentially unsolved space. It's the same excitement as the beginning of the internet.

None of us know what the answers will be.

AndyNemmity · 2025-12-30T01:23:29 1767057809

It was happening significantly before the rise of AI. It's even more now.

I am not sure where exactly we are headed through all this, but I feel like overall having data be a shared commons has been beneficial.