Cheaper fine-tuning + hosting for Llama/Mistral — would this help anyone

I’ve noticed a lot of people here talking about the high cost of fine-tuning and running models like Llama and Mistral.

I’m exploring an idea: what if you could upload your dataset, fine-tune a model, and get back a hosted endpoint — but instead of paying AWS/OpenAI rates, it ran on idle/off-peak GPUs for ~70% less?

Curious to hear from others here:
– Is cost the main blocker for you, or is it complexity/reliability?
– If something like this existed, would you try it?
– What would make it useful (or useless) for your projects?

I’m not selling anything — just testing the waters and would love feedback

Hi there. To be honest, if I read something about 70% less and hosted GPU you got my instant attention. I would be interested in such an option. Especially a form of BYOGPU (means in my idea BuyYurOwnGPU) where we could buy older GPU rigs located in a datacenter and then pay for colocation and networking and stuff could be a very interesting model for us. But also having the trained model hosted as endpoint for a fair price would be nice.