​LLMs in Production - A Costly Dilemma!

Published on ● Video Link: https://www.youtube.com/watch?v=Hne95kH5hxk



Duration: 50:45
364 views
8


When deploying machine learning models in production, there are three properties that are commonly desired: generalization, evaluation, and cost-optimality. We conjecture that for machine learning models, it is impossible to optimize all three. I will talk about our framework for cost modeling and evaluation for large language models (LLMs) and present our LLMOps production pipeline.







Tags:
deep learning
machine learning