More

gordonhart · 2026-04-24T15:29:25 1777044565

True, and they're being tried in a federal court of law for it. NYT v. OpenAI is still very much alive, these things just take a while. Can the same be said about DeepSeek or any other open-source model provider performing distillation?

copypaper · 2026-04-24T16:01:57 1777046517

Pandora's box has already been opened and there is no going back. I doubt OpenAI, et al will get anything but a slap on the wrist in court because punishing AI companies would have a negative effect on the US economy.

>Can the same be said about DeepSeek or any other open-source model provider performing distillation?

Open source models that distill from SoTA reminds me of the story of Robin Hood -- robbing the rich and giving it to the poor. So to answer your question: yes, but it's better than the alternative where only a select few companies have SoTA models.

gordonhart · 2026-04-24T16:10:53 1777047053

Robin Hood, famous for spinning his acts into a $220M ARR SaaS business (as of mid 2025 [0], likely >$1B by now) and using charity as a marketing mechanism.

[0] https://sqmagazine.co.uk/deepseek-ai-statistics/

copypaper · 2026-04-24T16:25:40 1777047940

touché hahah. Are there any SoTA open-source models that don't have corporate interest?

riskd · 2026-04-24T15:55:32 1777046132

You already know what the results of this “trial” will be. Let’s not pretend.

gordonhart · 2026-04-24T15:11:39 1777043499

Since it's open weights it'll be available on AWS Bedrock soon(ish), likely at a higher price than the official API but still coming in under those GPT-5-mini prices.

rohanm93 · 2026-04-24T20:42:37 1777063357

Interesting, thanks. I'll keep an eye out.

gordonhart · 2026-04-24T14:17:23 1777040243

Depends on who's making the call for who gets cut. A key part of decimation was that the doomed soldiers were beaten to death by their comrades to give the remaining 9 a bloody, lasting impression of their dishonor. If Meta makes everybody sit in a group with their ten closest coworkers and debate until they decide who gets cut it's a lot closer to decimation than if management suddenly shuts off 10% of employee computers.

gordonhart · 2026-04-24T02:17:23 1776997043

Yes, once benchmarks get saturated they get replaced by harder ones. You don’t see GSM8K, MMLU, or HellaSwag anymore because they’re essentially solved. It takes constant work to make benchmarks hard enough to show meaningful model performance differences but easy enough to score higher than the noise threshold.

gordonhart · 2026-04-23T19:12:25 1776971545

Truly. People are calling for Palantir employees to be targeted by foreign militaries right here in this comments section!

I'll ride this thread with you to the bottom of the page.

gordonhart · 2026-04-22T13:49:07 1776865747

It's frustrating how cavalier they are about killing old Gemini releases. My read is that once a new model is serving >90% of volume, which happens pretty quickly as most tools will just run the latest+greatest model, the standard Google cost/benefit analysis is applied and the old thing is unceremoniously switched off. It's actually surprising that they recently extended the EOL date for Gemini 2.5. Google has never been a particularly customer-obsessed company...

surajrmal · 2026-04-22T14:08:52 1776866932

What benefit is there to sticking on older models? If the API is the same, what are the switching costs?

kamranjon · 2026-04-22T14:47:21 1776869241

Consistency, new models don't behave the same on every task as their predecessors. So you end up building pipelines that rely on specific behavior, but now you find that the new model performs worse with regards to a specific task you were performing, or just behaves differently and needs prompt adjustments. They also can fundamentally change the default model settings during new releases, for example Gemini 2.5 models had completely different behavior with regards to temperature settings than previous models. It just creates a moving target that you constantly have to adjust and rework instead of providing a platform that you and by extension your users can rely on. Other providers have much longer deprecation windows, so they must at least understand this frustration.

overfeed · 2026-04-22T23:54:21 1776902061

> Consistency, new models don't behave the same on every task as their predecessors. So you end up building pipelines that rely on specific behavior

If this is a deal breaker, then self-hosting is the only solution. Due to the hardware premium, all models hosted by 3rd-parties will be deprecated to make room for newer, better, and more efficient models.

kamranjon · 2026-04-23T05:43:34 1776923014

Sure, but Google also leaves little to no overlap between models and often will leave models in preview mode (which many companies cannot use in production for legal reasons) - right up until the point that the previous model is deprecated.

The point is that if you want to build a platform that customers can rely on based on their own schedules of feature development, you need to support models for longer periods of time. For example, OpenAI is still offering older models like gpt4 which was released in 2023 - this gives customers plenty of time to test, experiment and eventually migrate to a newer model if it makes sense.

gordonhart · 2026-04-22T14:38:46 1776868726

If you're trying to run repeatable workflows, stability from not changing the model can outweigh the benefits of a smarter new model.

The cost can also change dramatically: on top of the higher token costs for Gemini Pro ($1.25/mtok input for 2.5 versus $2/mtok input for 3.1), the newer release also tokenizes images and PDF pages less efficiently by default (>2x token usage per image/page) so you end up paying much much more per request on the newer model.

These are somewhat niche concerns that don't apply to most chat or agentic coding use cases, but they're very real and account for some portion of the traffic that still flows to older Gemini releases.

akelly · 2026-04-22T14:37:46 1776868666

I've heard GenAI.mil still has Gemini 2.5 only.

gordonhart · 2026-04-22T14:40:50 1776868850

Wouldn't surprise me. The best model you can get on AWS GovCloud is still Claude Sonnet 4.5.

gordonhart · 2026-04-22T13:26:29 1776864389

The impact of a few more network calls and decreased privacy is basically never felt by users beyond this abstract "they're spying on me" realization. The impact of this telemetry for a product development team is material.

Not saying that telemetry more valuable than privacy, just that it's a straightforward decision for a company to make when real benefits are only counterbalanced by abstract privacy concerns. This is why it's so universally applied across apps and tools developed commercially.

TheDong · 2026-04-22T13:46:06 1776865566

For most CLIs, I definitely feel extra network calls because they translate to real latency for commands that _should_ be quick.

If I run "gh alias set foo bar", and that takes even a marginally perceptible amount of time, I'll feel like the tool I'm using is poorly built since a local alias obviously doesn't need network calls.

I do see that `gh` is spawning a child to do sending in the background (https://github.com/cli/cli/blob/3ad29588b8bf9f2390be652f46ee...), which also is something I'd be annoyed at since having background processes lingering in a shell's session is bad manners for a command that doesn't have a very good reason to do so.

SchemaLoad · 2026-04-22T23:36:21 1776900981

If it's done in a background process then it won't impact the speed of the tool at all. When the choice is between getting data to help improve the tool at the cost of "bad manners" whatever that means, the choice is pretty easy.

gordonhart · 2026-04-21T14:55:41 1776783341

I just bought an M5 Macbook from an electronics retailer because they actually stocked it, whereas ordering the same machine for the same price from Apple would have been a custom build delivered mid May.

gordonhart · 2026-04-21T13:52:45 1776779565

Apple has an enormous global footprint. Their devices are made in China, India, and Vietnam, and source parts from basically everywhere. More than half of Apple's revenue comes from outside the US and there are 1.5 billion iPhones in use across the world (somewhat larger than the population of the US).

gordonhart · 2026-04-20T17:20:45 1776705645

> the government forces you to do this, else you go to prison

You'll never guess what happens if you choose not to pay taxes.

joquarky · 2026-04-20T17:53:41 1776707621

They use tax money to house and feed you?

dctoedt · 2026-04-20T18:03:08 1776708188

> They use tax money to house and feed you?

Indeed — just not in the style to which you'd like to become accustomed ....

mothballed · 2026-04-20T17:27:49 1776706069

And this justifies it why?