David Robertson took a quantization challenge designed for CUDA experts, and solved it in Mojo with AI assistance, and ended up 1.07x to 1.84x faster than the state-of-the-art C++/CUDA implementation.
Co-founder here. There isn't any signup - that was 2+ years ago and we've been iterating a lot with the community and listening to feedback - which has been wonderful. Go freely and install with Pip, UV, Pixi etc -> https://docs.modular.com/mojo/manual/install
FWIW I didnt take the blog as a dunk on CUDA, just as an impressive outcome from the blog writer in Mojo. It's awesome to see this on Hopper - if it makes it go faster thats awesome.
So there is highly efficient matrix transpose in Mojo
All three Mojo kernels outperform their CUDA counterparts, with the naive and swizzle kernels showing significant improvements (20.6% and 14.8% faster respectively), while the final optimized kernel achieves essentially identical performance (slightly better by 4.14 GB/s).
The "flag" here seemed innapropriate given that its true this implementation is indeed faster, and certainly the final iteration could be improved on further. It wasn't wrong to say 14% or even 20%.
Users of the site only have one control available: the flag. There's no way to object only to the title but not to the post, and despite what you say that title hit the trifecta: not the original title, factually incorrect, and clickbait. So I'm not that surprised it got flagged (even if I did not flag it myself).
Email the mods at [email protected]. There's a chance they'll remove the flag and re-up the post.
I do find it amusing that all comments are focused on the US. There are hundreds of countries around the world were TV stations are black boxes and presidential / prime ministerial hopefuls have no platform to express themselves. Having access to a platform that can promote and answer content as users are searching for answers on them (at least how it reads on face value) seems incredibly important. I do however agree that transparency around who gets access to the platform is critical for user trust.
Well consider the number of 3rd party services that would shut or require a significant amount of development to get back online as a result of google shutting down:
Thanks for the question. We try to make the transaction process as fast as possible. As long as you ship your device the same day you do your swap, you can get your new device as fast as 2 days later. USPS Priority Mail shipping is used both ways and is free of charge. The time also varies based on location.