Skip to content

Model Landscape

When evaluating models, it helps to separate them into a few working dimensions:

  • capability
  • latency
  • cost
  • context window
  • tool use reliability
  • instruction-following quality

For most production use cases, the best model is rarely the most capable model in absolute terms. The best model is the one that fits the product constraint you actually care about.