The dupe detector works on full URL, not domain. The dupe detector only has effect within 8 hours of the same URL being submitted earlier, unless the earlier submission gets significant discussion then it has effect for 12 months.
Also, we only saw this because someone else emailed us. @replies don't work on HN. Please email us ([email protected]) to ensure we see reports like this, thanks!
Why bombarding him with a single kind of posts instead of showing him various things he might be interested in? That would give the algorithm a chance to learn faster and be more effective.
Besides that, with all the tracking Meta does around the web it’s fair to assume they have a more precise profile of the author they could have used
It wasn't given gcc source code, and was not given internet access. It the extent it could translate gcc source code, it'd need to be able to recall all of the gcc source from its weights.
All of this work is extraordinarily impressive. It is hard to predict the impact of any single research project the week it is released. I doubt we'll ever throw away GCC/LLVM. But, I'd be surprised if the Claude C Compiler didn't have long-term impact on computing down the road.
I occasionally - when I have tokens to spare, a MAX subscription only lasts so far - have Claude working on my Ruby compiler. Far harder language to AOT compile (or even parse correctly). And even 6 months ago it was astounding how well it'd work, even without what I now know about good harnesses...
I think that is the biggest outcome of this: The notes on the orchestration and validation setup they used were far more interesting than the compiler itself. That orchestration setup is already somewhat quaint, but it's still far more advanced than what most AI users use.
Me: Top 0.02%[1] human-level intelligence? Sure. But we aren't there yet.
[1] There are around 8k programming languages that are used (or were used) in practice (that is, they were deemed better than existing ones in some aspects) and there are around 50 million programmers. I use it to estimate how many people did something, which is objectively better than existing products.
reply