Skip to main content

List models

GET /v1/models uses the OpenAI list shape (object: "list", data array of { id, object, created, owned_by, tier }). Only GET /v1/models (no extra path segments) is public in middleware. Without credentials the resolver uses plan free for filtering. With X-API-Key or Authorization: Bearer (rk_ or access JWT), the plan comes from the key or token so the list matches what you can call.
curl -sS https://api.routing.run/v1/models
curl -sS https://api.routing.run/v1/models \
  -H "X-API-Key: ${ROUTING_RUN_API_KEY}"

Get model details

GET /v1/models/ requires middleware auth (same headers as chat). Unknown id → {"error":"Model not found"} (JSON).
curl -sS "https://api.routing.run/v1/models/route/deepseek-v3.2" \
  -H "X-API-Key: ${ROUTING_RUN_API_KEY}"
Success payload includes id, object, created, owned_by, tier (heuristic lite vs premium from config buckets), allowed (bool for your resolved plan), and providers (the ordered upstream list from config).

Available models

This table lists the published chat model IDs. Authoritative access for your workspace is whatever GET /v1/models returns when authenticated with your rk_ key (your plan may expose only a subset).

Chat models

ModelSourceDescription
route/deepseek-r1DeepSeekDeepSeek R1 reasoning
route/deepseek-v3.2DeepSeekDeepSeek V3.2 chat
route/deepseek-v3.2-specialeDeepSeekDeepSeek V3.2 Speciale
route/glm-4.7ZhipuGLM-4.7
route/glm-4.7-flashZhipuGLM-4.7 Flash
route/glm-5ZhipuGLM-5
route/glm-5-highspeedZhipuGLM-5 Highspeed
route/glm-5.1ZhipuGLM-5.1
route/glm-5.1-precisionZhipuGLM-5.1 Precision
route/kimi-k2.5MoonshotKimi K2.5
route/kimi-k2.5-highspeedMoonshotKimi K2.5 Highspeed
route/minimax-m2.5MiniMaxMiniMax M2.5
route/minimax-m2.7MiniMaxMiniMax M2.7
route/minimax-m2.7-highspeedMiniMaxMiniMax M2.7 Highspeed
route/qwen3.5-9bQwenQwen3.5 9B
route/qwen3.5-397b-a17bQwenQwen3.5 397B A17B
route/qwen3.5-plusQwenQwen3.5 Plus
route/qwen3.6-plusQwenQwen3.6 Plus
route/qwen3.6-plus-previewQwenQwen3.6 Plus Preview

Pricing

GET /v1/pricing is not on the anonymous path list — pass X-API-Key or Authorization: Bearer so middleware lets the request through. The handler returns static config (same table for any authenticated caller):
curl -sS https://api.routing.run/v1/pricing \
  -H "X-API-Key: ${ROUTING_RUN_API_KEY}"
{
  "models": [
    {
      "model": "route/deepseek-v3.2",
      "display_name": "DeepSeek V3.2",
      "tier": "free",
      "input_per_million": 0.0,
      "output_per_million": 0.0
    }
  ]
}
Numeric rates come from deployed model_pricing config; see dashboard for what you are actually charged after multipliers.
Inference usage is metered per request. Daily caps and credits depend on your plan tier — see the dashboard for current limits.

Image generation

ModelSourceDescription
route/minimax-image-1MiniMaxMiniMax Image Generation
See Image generation for details.