I’ve noticed a lot of people here talking about the high cost of fine-tuning and running models like Llama and Mistral.
I’m exploring an idea: what if you could upload your dataset, fine-tune a model, and get back a hosted endpoint — but instead of paying AWS/OpenAI rates, it ran on idle/off-peak GPUs for ~70% less?
Curious to hear from others here:
– Is cost the main blocker for you, or is it complexity/reliability?
– If something like this existed, would you try it?
– What would make it useful (or useless) for your projects?
I’m not selling anything — just testing the waters and would love feedback