My LLM token bill is getting painful.Besides switching to cheaper models, what have you personally used to reduce cost in real applications?
My LLM token bill is getting painful.Besides switching to cheaper models, what have you personally used to reduce cost in real applications?