A100 is available for $1.03 from GCP and V100 is available at $.27 per hour from...

jonathanlei · on July 29, 2022

Just a side note here, it seems that these are spot instances, whereas all our VMs are non-interruptible. So there is a bit of difference (e.g. you probably wouldn't do a weeklong Blender render on a GCP VM if it could be interrupted and lose your work, whereas you can definitely run it on TensorDock because our VMs are reserved).

Of course, you can set up data checkpointing to save your data, but overall, it is a bit of an extra hassle to run on spot/interruptible instances, and if you do get interrupted, you are wasting valuable time waiting for stock to free up again.

tinus_hn · on July 29, 2022

Kind of sad because a Blender render is exactly the right usecase as it doesn’t really matter that much if it takes a little bit longer.

wintermute9124 · on July 28, 2022

I have never seen this tool. Thanks for sharing!

The Alibaba price you cited is for interruptible/spot. For V100 on-demand (uninterruptible) Oracle is least expensive from that list at $1.275/hr.

freediver · on July 28, 2022

Thanks, I created this tool to exactly find cheapest GPUs then expanded to everything else.

Interruptible is sufficient for model training which is what these high end GPUs are typically used for (and you want cheapest possible)

jonathanlei · on July 28, 2022

Wow, cool! Yes - interruptible can be very cheap... I'll add it to our backlog so we do that instead of idly mining

I was wondering, do you happen to have an API for listing servers? We're launching a marketplace later in August (https://www.tensordock.com/product-marketplace), and we expect pricing to be really really cheap. Like #1 in industry cheap while retaining.

Interruptible, if we add that, would probably be even less than those prices listed.

It'd be really cool if we could auto-update availabilities of GPU servers through an API so that we can list our servers on your tool as well :)

freediver · on July 28, 2022

If your prices really end up being industry cheapest let me know and we can make it happen.

wintermute9124 · on July 28, 2022

Didn't realize you made this tool. It's super useful.

Some unsolicited feedback, if you're still actively developing:

- You should consider some of the lesser known cloud providers (e.g. Coreweave/Lambda Labs/TensorDock).

- Add information about whether the servers support NVLink/NVswitch. For example, A100s come in 3 flavors: PCIe without NVLink bridges, PCIe with NVLink bridges, and SXM with NVlink/NVSwitch fabric.

freediver · on July 28, 2022

There are hundreds of smaller providers, each having a different API , if having it at all, so this is not possible for a one man operation. CloudOptimizer is already the largest cloud comparison tool on the web (12 cloud providers listed)