Managing GPU Rentals with Rsync: Workflow for Volatile Cloud Resources
svana.name·1d·
Discuss: Hacker News
Flag this post

The GPU rental market is a bit of a mess.

First, not even big cloud providers can guarantee GPU availability at all times and places. Sure, you could keep the VM running for months, but that would burn your budget quite quickly, especially if you are a solo developer or work in a small team.

Second, permanent storage is severely limited. Some providers offer persistent drives whose lifecycles are independent of the VMs. But these drives are still usually bound to a specific region. What if that region is currently out of GPUs? That drive is now essentially useless.

I’ve been thinking about how to organize my workflow around these limitations. Here is what I have come up with so far.

Rsync to the rescue

At my job, I have to fine-tune and train a lot of models. My company h…

Similar Posts

Loading similar posts...