**Input tokens:** The text you send to the model (prompts, context, documents) - **Output tokens:** The text the model generates (typically 2-4x more expensive per token than input) - **Fine-tuning costs:** One-time training costs if you customize a model - **Infrastructure costs:** For on-premise d