Which GPU cloud provider are you actually using for inference?

Hey everyone,

I’ve been looking at providers like RunPod, Vast.ai, Lambda Labs, and a few others, and every time I need GPU capacity I end up spending way too much time comparing them. Prices change, availability changes, and it’s hard to know which providers are actually reliable in practice.

I’m working on a tool that recommends a provider based on your specific use case (model, workload, region, priorities, etc.) instead of just showing a list of prices.

Before I invest more time into it, I’d love to hear how people are handling this today: Which provider are you currently using, and what made you choose it? Do you regularly switch providers, or mostly stick with one? What’s the most frustrating part of choosing a GPU cloud provider?

Any real-world experiences would be super helpful. Thanks!

1 Like