More

thinkling · 2026-03-06T19:21:35 1772824895

Agreed, would really like to understand what this (setting the LLM up to assume a role to improve performance) is doing under the cover and why it works.

Why aren't the labs training models to pick a mantra appropriate to the task and do this themselves? "Huh, a database question. I am going to pretend I'm a database expert with lots of experience. OK, here we go!"

thinkling · 2026-03-03T23:49:22 1772581762

My read was roughly that agents require constraining scaffolding (CLAUDE.md) and careful phrasing (prompt engineering) which together is vaguely like working in a DSL?

thinkling · 2026-03-02T18:31:05 1772476265

Many apps are missing many keyboard shortcuts that you may be used to if you’ve used the equivalent on the desktop. You’ll need to keep the iPad screen accessible to tap on UX elements. There’s also the issue that shortcuts that do exist may be hard to discover because there’s no menu bar to look in.

spectre3d · 2026-03-02T21:43:43 1772487823

> Many apps are missing many keyboard shortcuts that you may be used to

This is true. To see the ones that are available, hold down the command ⌘ key to get a scrollable list of all of the shortcuts for the app you’re currently using, and use Fn-m or globe key-m to see a list of the system shortcuts.

thinkling · 2026-02-25T23:47:38 1772063258

I don't think hybrids use skateboard designs the way EVs do? The battery for a hybrid is so much smaller, they usually steal space under the rear seat and/or in the trunk afaik.

thinkling · 2026-02-24T01:12:43 1771895563

I fondly remember running PageMaker on my MacPlus. It was as mindblowing as HyperCard was.

thinkling · 2026-02-17T19:11:04 1771355464

For comparisonI think the current leader in pelican drawing is Gemini 3 Deep Think:

https://bsky.app/profile/simonwillison.net/post/3meolxx5s722...

konart · 2026-02-17T19:34:12 1771356852

My take (also Gemini 3 Deep Think): https://gemini.google.com/share/12e672dd39b7

Somehow it's much better now.

jazzyjackson · 2026-02-17T19:51:38 1771357898

I’m not familiar with Gemini, isn’t this just a diffusion model output? The Pelican test is for the llm to produce SVG markup.

konart · 2026-02-17T20:01:04 1771358464

Yeah, I was so amazed by the result I didn't even realize Gemini used Nano Banana while producing the result.

badc0ffee · 2026-02-18T17:53:46 1771437226

The point of the penny-farthing is that you drive the front wheel directly with the pedals, but this seems to have the pedals in a spot where they would drive a chain, although there is no chain?

kingbob000 · 2026-02-17T23:38:38 1771371518

Is that actually better? That pelican has arms sprouting out of its wings

thinkling · 2026-02-16T19:08:29 1771268909

You can see up-thread that the same model will produce different answers for different people or even from run to run.

That seems problematic for a very basic question.

Yes, models can be harnessed with structures that run queries 100x and take the "best" answer, and we can claim that if the best answer gets it right, models therefore "can solve" the problem. But for practical end-user AI use, high error rates are a problem and greatly undermine confidence.

thinkling · 2026-02-09T21:10:39 1770671439

Most importantly, Slack limits the amount of message history you get to keep if you’re not paying. And the payment plans are per-user fees which quickly becomes non-viable for non-commercial use.

thinkling · 2026-02-04T22:10:26 1770243026

Ideally, ethical buyers would cause the market to line up behind ethical products. For that to be possible, we have to have choices available to us. Seems to me Anthropic is making such a choice available to see if buyers will line up behind it.

fogzen · 2026-02-05T00:46:48 1770252408

“Ideally” is doing a lot of heavy lifting here.

thinkling · 2026-01-28T17:33:56 1769621636

The WF store I frequent has lousy cell reception, so add th step “open Settings app and get on store’s wifi” (and who knows what all that lets them track).