Model Landscape¶
When evaluating models, it helps to separate them into a few working dimensions:
- capability
- latency
- cost
- context window
- tool use reliability
- instruction-following quality
For most production use cases, the best model is rarely the most capable model in absolute terms. The best model is the one that fits the product constraint you actually care about.