More

nmitchko · 2026-05-07T18:32:14 1778178734

A fantastically simple solution to improving algorithms, I wish I had this years ago in activation engineering: https://blog.n.ichol.ai/llm-activation-engineering-an-easy-f...

How do I access AlphaEvolve?

Yokohiii · 2026-05-07T18:35:25 1778178925

This is just a flex post. Be a billion dollar company or get out.

charleshn · 2026-05-08T01:24:18 1778203458

They'll likely make it available at some point, but for now one can use OpenEvolve [0] which is not quite as good but should be a good start to use the same LLM-driven evolutionary framework.

[0] https://github.com/algorithmicsuperintelligence/openevolve

petra · 2026-05-08T03:50:50 1778212250

There's also: https://github.com/inter-co/science-codeevolve and https://www.turintech.ai/

Yokohiii · 2026-05-08T02:44:28 1778208268

Your link seems completely unrelated. Why would you suggest that?

charleshn · 2026-05-08T02:56:46 1778209006

Not sure what you mean: OpenEvolve is an open source implementation of AlphaEvolve: https://huggingface.co/blog/codelion/openevolve

Yokohiii · 2026-05-08T11:34:07 1778240047

Thanks, wasn't really apparent on the github page.

nmitchko · 2026-04-07T13:26:29 1775568389

I used those two in combination to fix pain after 3x surgeries to repair a torn pec + infection. They work and helped me heal from being at a 3/10 constant pain down to baseline.

Not something I would do at any point for fun. But anecdotally, it's materially better than other alternatives offered/available.

nmitchko · 2026-02-26T15:40:36 1772120436

Can someone make a startup that allows me to do this as an individual?

loeg · 2026-02-26T16:02:00 1772121720

Join Bluesky and you too can lie about whatever you want.

swiftcoder · 2026-02-26T17:03:18 1772125398

It's called a "farm" (note the quotes). You may need a few acres of very cheap rural land, and some chickens. The IRS loves chickens

xnx · 2026-02-26T17:11:20 1772125880

> The IRS loves chickens

Or even just bees: https://mountainsweethoney.com/state-guide-for-beekeeper-tax...

candiddevmike · 2026-02-26T15:43:32 1772120612

Join that startup as a founder, have a million+ exit and you will have the capability to do this as an individual.

pimlottc · 2026-02-26T15:47:48 1772120868

Don't be poor, got it.

yoyohello13 · 2026-02-26T15:51:42 1772121102

Good life advice in general really.

palmotea · 2026-02-26T15:57:06 1772121426

Don't know why so many people are so stupid they don't follow such simple and sensible advice. /s

havefunbesafe · 2026-02-26T16:04:18 1772121858

Effective exit rate tax is around 24%

wang_li · 2026-02-26T16:27:29 1772123249

You don't need a startup. Millions of people have an effective tax rate that is 0% and they have a net tax rate that is negative. They do this simply by having no meaningful skills or knowledge.

dboreham · 2026-02-26T16:03:06 1772121786

Individual Meta employees and shareholders couldn't do this either.

terminalshort · 2026-02-26T17:02:52 1772125372

But they can. Any Meta employee or shareholder is also free to go on Bluesky and tell lies about taxes.

TiredOfLife · 2026-02-26T16:43:58 1772124238

You can make stuff up even on this site.

nmitchko · 2025-10-10T21:36:02 1760132162

In case anyone wants to do this themselves, check out the pipeline here: https://github.com/isc-nmitchko/iris-document-search

Colnomic and nvidia models are great for embedding images and MUVERA can transform those to 1D vectors.

losteric · 2025-10-11T00:40:56 1760143256

> check out the pipeline here

“the pipeline” - seems like this is just a personal hackathon project?

Why these models vs other multimodals? Which “nvidia models”?

nmitchko · 2025-09-22T19:57:09 1758571029

Next steps for AI in general:

  - additional modalities
  - Faster FPS (inferences per second)
  - Reaction time tuning (latency vs quality tradeoff) for visual and audio inputs/outputs
  - built-in planning modules in the architecture (think premotor frontal lobe)
  - time awareness during inference (towards an always inferring / always learning architecture)

nmitchko · on Oct 18, 2024

Interesting they don't compare to open-bio. Page 7 charts are quite weak.

https://huggingface.co/aaditya/Llama3-OpenBioLLM-70B

st-at-picnic · on Oct 18, 2024

Steve here, one of the co-authors. Totally valid on OpenBio. I will say that comparison numbers for this paper were such a challenge, in part because we found that a lot of the LLMs on the Medical LLM leaderboard struggled to follow even slight changes in instructions. On one hand it felt inaccurate to just print '[something very low]% Accuracy' on structuring/abstraction tasks and call it a day, but it also seemed like the amount of engineering effort needed to get non-trivial results from those LLMs was saying something important about how they worked.

I think that's especially true when you look at how well GPT-4o worked out of the box -- it makes clear what you get from the battle-hardening that's done to the big commercial models. For the numbers we did include, the thought was that was the most meaningful signal was that going from 8B to 70B with Llama3 actually gives you a lot in terms of mitigating that brittleness. That goes a step towards explaining the story of what we're seeing, moreso than showing a bunch of comparison LLMs fall over out of the box.

In the end, we presented those models that did best with light tuning and optimization (say a week's worth of iteration or so). I anticipate that we'll have to expand these results to include OpenBio as we work through the conference reviewer gauntlet. Any others you think we definitely should work to include? Would definitely be helpful!

nmitchko · on Oct 18, 2024

No other models that are public worth comparing to... Hippocratic advertises good benchmarks but that might be marketing fluff.

Have you checked out dataset building with nemotron? The nemotron synthetic data builder is quite powerful.

Moreso, check out model merging. It's possible if you merge some of your model against llama3.1 base it may perform much better.

Check out max labonne's work on hugging face

nmitchko · on July 17, 2024

We're excited to share pitchpilot with the HN community. Our beta users have found the embedded audio particularly useful for enterprise sharing. We're keen to keep improving, and our mission is to make communication easier.

In the roadmap is adding video export, digital twin presentations, and real-time presentations. We don't wrap a public LLM, so we don't share any data.

nmitchko · on July 7, 2024

Given that Generative AI can now read brain scans [1] and this, I wonder how far away we are from "you thought negatively about something, the authorities are on their way".

[1] -- https://www.biorxiv.org/content/10.1101/2022.11.18.517004v3

llamaimperative · on July 7, 2024

Well we’re not infinitely far away from it, which is why we need to build political and legal systems that can respect human dignity even in the presence of such technologies.

Be sure to vote :)

hcfman · on July 7, 2024

He is going to build these ? The same people that are build systems expressly to avoid accountability?

belter · on July 7, 2024

The EU will want to scan your brain...for the children....

nmitchko · on June 5, 2024

Tin-foil hat time:

1. First, models will predict pollution. The outcomes will help shape urban policy. But these won't solve crime or stop people from driving.

2. Second, models will predict individual behavior and track person level emissions. The outcomes will force behavior changes, mostly freedom limiting.

3. Third, and finally, models will predict thoughts. The the thought of driving instead of walking might trigger a response.

It's a slippery slope and we need to draw a line between prediction and policy.

mpalmer · on June 5, 2024

That is some heavy-duty foil in your hat there.

Even allowing for the ridiculously massive technical leap from 1 to 2 and then 2 to 3, it doesn't make much sense.

For one thing, if states are determined to enforce individual emissions limits, they can do it today with legislation. You don't need a predictive model. What does the model add?

Also, the only difference between 2 and 3 is whether a person acts on a thought.

So are you suggesting with #3 that predicted thoughts (e.g. not literal mind reading) which a person doesn't act upon will prompt state action?

MSFT_Edging · on June 5, 2024

Why is it that freedom is always tied to the right to pollute as much as possible, as opposed to the right to live in a world with low pollution?

mpalmer · on June 5, 2024

Using the unqualified word "freedom" has an ambiguity that political actors exploit. Freedom to do something is entirely separate to "free to live in a world where ___".

To be honest, I feel the latter sense of the word is a bit of a stretch - semantically, not politically.

But you see it because "freedom" is a powerful word in politics, and rather than argue against "freedom", pundits go up the ladder of abstraction and argue the definition instead.

MSFT_Edging · on June 5, 2024

Sorry, that question was rhetorical to point out the sillyness of equating driving to freedom.

mpalmer · on June 5, 2024

Ah. Well I hope my answer was useful for anyone who didn't take it as rhetorical!

userbinator · on June 5, 2024

Indeed this is the another thing pushing us towards dystopia. Now it's "climate change" . Previously it was drugs and terrorism.

nmitchko · on May 23, 2024

How does this compare to ehealthexchange or other qhins that have many years of experience and charge lower costs?

dgoncharov · on May 23, 2024

> How does this compare to ehealthexchange

Good question! eHealth Exchange (eHEX) is one of 3 national HIEs that we connect to (currently through Carequality). eHEX is mainly focused on connecting to state-level regional HIEs, which cover a different portion of providers than CommonWell, or Carequality do.

For example, Cerner is a major EHR vendor (used by the VA and others) whose data can only be accessed through CommonWell, since they don't participate in other HIEs.

> that have many years of experience

Relatively speaking, modern HIEs are a relatively new concept (Carequality was founded in 2014) - so extra years of experience doesn't necessarily add any value, and usually just results in more legacy tech to deal with!

> charge lower costs?

This isn't necessarily true - since you brought up eHEX, see their pricing page: https://ehealthexchange.org/pricing-payers-vendors-and-for-p...

TL;DR just to get started it's going to cost you $20k + some months to integrate, $12.5k/yr as the base membership fee (up to $400k if you make a lot of money!), and they charge a per-query price.

The caveat here is per-query in eHEX, isn't what a query is in Metriport. They literally mean every single query (remember the HTTP requests to thousands of endpoints to find patient records, each one of those would be a query). So, if you want to integrate with eHEX only to get limited, messy C-CDA data, then you're looking at paying ~$0.80 per full record retrieval for a patient with 2k documents.