AI glossary
Inference
Running a model to get a response. As opposed to training the model. Most LLM cost in production is inference.
AI glossary
Running a model to get a response. As opposed to training the model. Most LLM cost in production is inference.