Documentation Index
Fetch the complete documentation index at: https://docs.routing.run/llms.txt
Use this file to discover all available pages before exploring further.
List models
GET /v1/models uses the OpenAI list shape (object: "list", data array of { id, object, created, owned_by, tier }).
Only GET /v1/models (no extra path segments) is public in middleware.
Authenticated calls do not currently expand this list to your full plan tier. Treat it as the published public catalog, not as the authoritative answer for higher-tier access.
curl -sS https://api.routing.run/v1/models
curl -sS https://api.routing.run/v1/models \
-H "X-API-Key: ${ROUTING_RUN_API_KEY}"
Get model details
GET /v1/models/ requires middleware auth (same headers as chat). Unknown id → {"error":"Model not found"} (JSON).
curl -sS "https://api.routing.run/v1/models/route/deepseek-v3.2" \
-H "X-API-Key: ${ROUTING_RUN_API_KEY}"
Success payload includes id, object, created, owned_by, tier, and allowed (bool for your resolved plan).
Available models
These are the current published chat model IDs from routing config.
| Route | Provider | Context |
|---|
route/minimax-m2.5 | minimax/opencode | 100k |
route/minimax-m2.7 | opencode/minimax | 100k |
route/kimi-k2.5 | crof | 131k |
route/kimi-k2.6 | crof | 131k |
route/glm-5 | crof | 200k |
route/deepseek-v3.2 | crof/chutes | 163k |
route/deepseek-v4-pro | crof | 163k |
route/qwen3.5-plus | opencode | 100k |
route/mistral-large-3 | routing-inference | See model metadata |
route/mistral-medium-2505 | routing-inference | See model metadata |
route/mistral-small-2503 | routing-inference | See model metadata |
Plan groups
These groups show the current docs catalog by plan.
Free
| Model |
|---|
route/kimi-k2.5 |
route/glm-5 |
route/deepseek-v3.2 |
route/qwen3.5-9b |
route/qwen3.5-397b-a17b |
route/gemma-4-31b-it |
Lite adds
| Model |
|---|
route/minimax-m2.5 |
route/minimax-m2.7 |
route/minimax-m2.7-highspeed |
route/kimi-k2.5-highspeed |
route/glm-5.1 |
route/glm-4.7 |
route/glm-4.7-flash |
Max adds
| Model |
|---|
route/qwen3.5-plus |
route/qwen3.6-plus |
route/glm-5.1-precision |
route/glm-5-highspeed |
route/kimi-k2.6-precision |
route/mimo-v2-omni |
route/mimo-v2-pro |
route/deepseek-v3.2-speciale |
route/deepseek-r1 |
Premium, Max, and Ultra
Mistral chat, TTS, STT, image, embedding, and rerank models are available on Premium, Max, and Ultra plans. See OpenAI compatibility, Audio, Image generation, Embeddings and rerank, and Limits.
| Model | Endpoint | Use case |
|---|
route/cohere-embed-v3-english-3 | /v1/embeddings | English semantic search and RAG |
route/cohere-embed-v3-multilingual-3 | /v1/embeddings | Multilingual semantic search and RAG |
route/qwen3-embedding-8b | /v1/embeddings | General-purpose embeddings |
route/cohere-rerank-v4.0-pro | /v1/rerank | Reranking search results and RAG contexts |
route/mistral-large-3 | /v1/chat/completions | Premium chat completion and writing |
route/mistral-medium-2505 | /v1/chat/completions | Summarization and balanced generation |
route/mistral-small-2503 | /v1/chat/completions | Fast summarization and lightweight generation |
Pricing
GET /v1/pricing is not on the anonymous path list — pass X-API-Key or Authorization: Bearer so middleware lets the request through. The handler returns static config (same table for any authenticated caller):
curl -sS https://api.routing.run/v1/pricing \
-H "X-API-Key: ${ROUTING_RUN_API_KEY}"
{
"models": [
{
"model": "route/deepseek-v3.2",
"display_name": "DeepSeek V3.2",
"tier": "free",
"input_per_million": 0.0,
"output_per_million": 0.0
}
]
}
Numeric rates come from deployed model_pricing config; see dashboard for what you are actually charged after multipliers.
Inference usage is metered per request. Daily caps and credits depend on your plan tier — see the dashboard for current limits.
Audio, image generation, embeddings, and rerank
Audio models are used by POST /v1/audio/speech and POST /v1/audio/transcriptions.
Image models are used by POST /v1/images/generations.
Embedding models are used by POST /v1/embeddings.
Rerank models are used by POST /v1/rerank.
See Audio, Image generation, and Embeddings and rerank for details.