Supported Models
Holon includes built-in configuration for 47 providers and 236 models.
This page is auto-generated from the Holon source code (src/model_catalog.rs and src/config.rs).
Run cargo run --bin holon-docgen -- models > docs/website/reference/models.md to regenerate.
Provider Setup
Each provider requires an API key or credential to use. Set the listed environment variable before running Holon.
| Provider | Transport | Base URL | Auth Env Variable(s) |
|---|---|---|---|
anthropic | Anthropic Messages | https://api.anthropic.com | ANTHROPIC_AUTH_TOKEN |
arcee | OpenAI Chat Completions | https://api.arcee.ai/api/v1 | ARCEE_API_KEY |
bigmodel | Anthropic Messages | https://open.bigmodel.cn/api/anthropic | BIGMODEL_API_KEY |
bigmodel-anthropic | Anthropic Messages | https://open.bigmodel.cn/api/anthropic | BIGMODEL_API_KEY |
bigmodel-openai | OpenAI Chat Completions | https://open.bigmodel.cn/api/paas/v4 | BIGMODEL_API_KEY |
byteplus | OpenAI Chat Completions | https://ark.ap-southeast.bytepluses.com/api/v3 | BYTEPLUS_API_KEY |
byteplus-coding | OpenAI Chat Completions | https://ark.ap-southeast.bytepluses.com/api/coding/v3 | BYTEPLUS_CODING_API_KEY or BYTEPLUS_API_KEY |
chutes | OpenAI Chat Completions | https://llm.chutes.ai/v1 | CHUTES_API_KEY |
deepseek | Anthropic Messages | https://api.deepseek.com/anthropic | DEEPSEEK_API_KEY |
deepseek-anthropic | Anthropic Messages | https://api.deepseek.com/anthropic | DEEPSEEK_API_KEY |
deepseek-openai | OpenAI Chat Completions | https://api.deepseek.com/v1 | DEEPSEEK_API_KEY |
fireworks | OpenAI Chat Completions | https://api.fireworks.ai/inference/v1 | FIREWORKS_API_KEY |
huggingface | OpenAI Chat Completions | https://router.huggingface.co/v1 | HUGGINGFACE_API_KEY or HF_TOKEN |
kilocode | OpenAI Chat Completions | https://api.kilo.ai/api/gateway | KILOCODE_API_KEY |
litellm | OpenAI Chat Completions | http://localhost:4000 | LITELLM_API_KEY |
minimax | Anthropic Messages | https://api.minimax.io/anthropic | MINIMAX_API_KEY |
minimax-portal | Anthropic Messages | https://api.minimax.io/anthropic | MINIMAX_API_KEY |
mistral | OpenAI Chat Completions | https://api.mistral.ai/v1 | MISTRAL_API_KEY |
moonshot | OpenAI Chat Completions | https://api.moonshot.ai/v1 | MOONSHOT_API_KEY |
nearai | OpenAI Chat Completions | https://cloud-api.near.ai/v1 | NEARAI_API_KEY |
nvidia | OpenAI Chat Completions | https://integrate.api.nvidia.com/v1 | NVIDIA_API_KEY |
openai | OpenAI Responses | https://api.openai.com/v1 | OPENAI_API_KEY |
openai-codex | OpenAI Codex | https://chatgpt.com/backend-api/codex | — |
opencode-go | OpenAI Chat Completions | https://opencode.ai/zen/go/v1 | OPENCODE_GO_API_KEY |
openrouter | OpenAI Chat Completions | https://openrouter.ai/api/v1 | OPENROUTER_API_KEY |
qianfan | OpenAI Chat Completions | https://qianfan.baidubce.com/v2 | QIANFAN_API_KEY |
qwen | OpenAI Chat Completions | https://coding-intl.dashscope.aliyuncs.com/v1 | QWEN_API_KEY or DASHSCOPE_API_KEY |
stepfun | OpenAI Chat Completions | https://api.stepfun.ai/v1 | STEPFUN_API_KEY |
stepfun-plan | OpenAI Chat Completions | https://api.stepfun.ai/step_plan/v1 | STEPFUN_PLAN_API_KEY or STEPFUN_API_KEY |
synthetic | Anthropic Messages | https://api.synthetic.new/anthropic | SYNTHETIC_API_KEY |
tencent-tokenhub | OpenAI Chat Completions | https://tokenhub.tencentmaas.com/v1 | TOKENHUB_API_KEY |
together | OpenAI Chat Completions | https://api.together.xyz/v1 | TOGETHER_API_KEY |
venice | OpenAI Chat Completions | https://api.venice.ai/api/v1 | VENICE_API_KEY |
vercel-ai-gateway | Anthropic Messages | https://ai-gateway.vercel.sh | AI_GATEWAY_API_KEY or VERCEL_AI_GATEWAY_API_KEY |
vllm | OpenAI Chat Completions | http://127.0.0.1:8000/v1 | — |
volcengine | OpenAI Chat Completions | https://ark.cn-beijing.volces.com/api/v3 | VOLCENGINE_API_KEY or ARK_API_KEY |
volcengine-coding | OpenAI Chat Completions | https://ark.cn-beijing.volces.com/api/coding/v3 | VOLCENGINE_CODING_API_KEY or VOLCENGINE_API_KEY or ARK_API_KEY |
xai | OpenAI Chat Completions | https://api.x.ai/v1 | XAI_API_KEY |
xiaomi | Anthropic Messages | https://api.xiaomimimo.com/anthropic | XIAOMI_API_KEY |
xiaomi-anthropic | Anthropic Messages | https://api.xiaomimimo.com/anthropic | XIAOMI_API_KEY |
xiaomi-openai | OpenAI Chat Completions | https://api.xiaomimimo.com/v1 | XIAOMI_API_KEY |
xiaomi-token-plan | Anthropic Messages | https://token-plan-cn.xiaomimimo.com/anthropic | XIAOMI_TOKEN_PLAN_API_KEY |
xiaomi-token-plan-anthropic | Anthropic Messages | https://token-plan-cn.xiaomimimo.com/anthropic | XIAOMI_TOKEN_PLAN_API_KEY |
xiaomi-token-plan-openai | OpenAI Chat Completions | https://token-plan-cn.xiaomimimo.com/v1 | XIAOMI_TOKEN_PLAN_API_KEY |
zai | Anthropic Messages | https://api.z.ai/api/anthropic | ZAI_API_KEY |
zai-anthropic | Anthropic Messages | https://api.z.ai/api/anthropic | ZAI_API_KEY |
zai-openai | OpenAI Chat Completions | https://api.z.ai/api/paas/v4 | ZAI_API_KEY |
Model Catalog
The table below lists every built-in model with its context window, max output tokens, and capabilities.
| Provider | Model | Usage | Context Window | Max Output | Reasoning | Image |
|---|---|---|---|---|---|---|
anthropic | claude-haiku-4-5 | anthropic/claude-haiku-4-5 | 200000 | 32000 | — | ✅ |
anthropic | claude-opus-4-5 | anthropic/claude-opus-4-5 | 200000 | 64000 | ✅ | ✅ |
anthropic | claude-opus-4-6 | anthropic/claude-opus-4-6 | 1000000 | 128000 | ✅ | ✅ |
anthropic | claude-opus-4-7 | anthropic/claude-opus-4-7 | 1000000 | 128000 | ✅ | ✅ |
anthropic | claude-sonnet-4-5 | anthropic/claude-sonnet-4-5 | 200000 | 64000 | ✅ | ✅ |
anthropic | claude-sonnet-4-6 | anthropic/claude-sonnet-4-6 | 200000 | 32000 | — | ✅ |
arcee | trinity-large-preview | arcee/trinity-large-preview | 131072 | 16384 | — | — |
arcee | trinity-large-thinking | arcee/trinity-large-thinking | 262144 | 80000 | ✅ | — |
arcee | trinity-mini | arcee/trinity-mini | 131072 | 80000 | — | — |
bigmodel | glm-4.5 | bigmodel/glm-4.5 | 131072 | 98304 | ✅ | — |
bigmodel | glm-4.5-air | bigmodel/glm-4.5-air | 131072 | 98304 | ✅ | — |
bigmodel | glm-4.5-flash | bigmodel/glm-4.5-flash | 131072 | 98304 | ✅ | — |
bigmodel | glm-4.5v | bigmodel/glm-4.5v | 64000 | 16384 | ✅ | ✅ |
bigmodel | glm-4.6 | bigmodel/glm-4.6 | 204800 | 131072 | ✅ | — |
bigmodel | glm-4.6v | bigmodel/glm-4.6v | 128000 | 32768 | ✅ | ✅ |
bigmodel | glm-4.7 | bigmodel/glm-4.7 | 204800 | 131072 | ✅ | — |
bigmodel | glm-4.7-flash | bigmodel/glm-4.7-flash | 200000 | 131072 | ✅ | — |
bigmodel | glm-4.7-flashx | bigmodel/glm-4.7-flashx | 200000 | 128000 | ✅ | — |
bigmodel | glm-5 | bigmodel/glm-5 | 202800 | 131100 | ✅ | — |
bigmodel | glm-5-turbo | bigmodel/glm-5-turbo | 202800 | 131100 | ✅ | — |
bigmodel | glm-5.1 | bigmodel/glm-5.1 | 202800 | 131100 | ✅ | — |
bigmodel | glm-5v-turbo | bigmodel/glm-5v-turbo | 202800 | 131100 | ✅ | ✅ |
bigmodel-anthropic | glm-4.5 | bigmodel-anthropic/glm-4.5 | 131072 | 98304 | ✅ | — |
bigmodel-anthropic | glm-4.5-air | bigmodel-anthropic/glm-4.5-air | 131072 | 98304 | ✅ | — |
bigmodel-anthropic | glm-4.5-flash | bigmodel-anthropic/glm-4.5-flash | 131072 | 98304 | ✅ | — |
bigmodel-anthropic | glm-4.5v | bigmodel-anthropic/glm-4.5v | 64000 | 16384 | ✅ | ✅ |
bigmodel-anthropic | glm-4.6 | bigmodel-anthropic/glm-4.6 | 204800 | 131072 | ✅ | — |
bigmodel-anthropic | glm-4.6v | bigmodel-anthropic/glm-4.6v | 128000 | 32768 | ✅ | ✅ |
bigmodel-anthropic | glm-4.7 | bigmodel-anthropic/glm-4.7 | 204800 | 131072 | ✅ | — |
bigmodel-anthropic | glm-4.7-flash | bigmodel-anthropic/glm-4.7-flash | 200000 | 131072 | ✅ | — |
bigmodel-anthropic | glm-4.7-flashx | bigmodel-anthropic/glm-4.7-flashx | 200000 | 128000 | ✅ | — |
bigmodel-anthropic | glm-5 | bigmodel-anthropic/glm-5 | 202800 | 131100 | ✅ | — |
bigmodel-anthropic | glm-5-turbo | bigmodel-anthropic/glm-5-turbo | 202800 | 131100 | ✅ | — |
bigmodel-anthropic | glm-5.1 | bigmodel-anthropic/glm-5.1 | 202800 | 131100 | ✅ | — |
bigmodel-anthropic | glm-5v-turbo | bigmodel-anthropic/glm-5v-turbo | 202800 | 131100 | ✅ | ✅ |
bigmodel-openai | glm-4.5 | bigmodel-openai/glm-4.5 | 131072 | 98304 | ✅ | — |
bigmodel-openai | glm-4.5-air | bigmodel-openai/glm-4.5-air | 131072 | 98304 | ✅ | — |
bigmodel-openai | glm-4.5-flash | bigmodel-openai/glm-4.5-flash | 131072 | 98304 | ✅ | — |
bigmodel-openai | glm-4.5v | bigmodel-openai/glm-4.5v | 64000 | 16384 | ✅ | ✅ |
bigmodel-openai | glm-4.6 | bigmodel-openai/glm-4.6 | 204800 | 131072 | ✅ | — |
bigmodel-openai | glm-4.6v | bigmodel-openai/glm-4.6v | 128000 | 32768 | ✅ | ✅ |
bigmodel-openai | glm-4.7 | bigmodel-openai/glm-4.7 | 204800 | 131072 | ✅ | — |
bigmodel-openai | glm-4.7-flash | bigmodel-openai/glm-4.7-flash | 200000 | 131072 | ✅ | — |
bigmodel-openai | glm-4.7-flashx | bigmodel-openai/glm-4.7-flashx | 200000 | 128000 | ✅ | — |
bigmodel-openai | glm-5 | bigmodel-openai/glm-5 | 202800 | 131100 | ✅ | — |
bigmodel-openai | glm-5-turbo | bigmodel-openai/glm-5-turbo | 202800 | 131100 | ✅ | — |
bigmodel-openai | glm-5.1 | bigmodel-openai/glm-5.1 | 202800 | 131100 | ✅ | — |
bigmodel-openai | glm-5v-turbo | bigmodel-openai/glm-5v-turbo | 202800 | 131100 | ✅ | ✅ |
byteplus | moonshotai/kimi-k2.5 | byteplus/moonshotai/kimi-k2.5 | 262144 | 32768 | ✅ | ✅ |
byteplus | seed-1-8-251228 | byteplus/seed-1-8-251228 | 256000 | 4096 | — | ✅ |
byteplus | zai-org/glm-4.7 | byteplus/zai-org/glm-4.7 | 204800 | 131072 | ✅ | — |
byteplus-coding | ark-code-latest | byteplus-coding/ark-code-latest | 256000 | 65536 | ✅ | — |
chutes | deepseek-ai/DeepSeek-V3.2-TEE | chutes/deepseek-ai/DeepSeek-V3.2-TEE | 131072 | 65536 | ✅ | — |
chutes | moonshotai/Kimi-K2.5-TEE | chutes/moonshotai/Kimi-K2.5-TEE | 262144 | 65535 | ✅ | ✅ |
chutes | openai/gpt-oss-120b-TEE | chutes/openai/gpt-oss-120b-TEE | 131072 | 65536 | ✅ | — |
chutes | zai-org/GLM-4.7-TEE | chutes/zai-org/GLM-4.7-TEE | 202752 | 65535 | ✅ | — |
deepseek | deepseek-chat | deepseek/deepseek-chat | 131072 | 8192 | — | — |
deepseek | deepseek-reasoner | deepseek/deepseek-reasoner | 131072 | 65536 | ✅ | — |
deepseek | deepseek-v4-flash | deepseek/deepseek-v4-flash | 1000000 | 384000 | ✅ | — |
deepseek | deepseek-v4-pro | deepseek/deepseek-v4-pro | 1000000 | 384000 | ✅ | — |
deepseek-anthropic | deepseek-chat | deepseek-anthropic/deepseek-chat | 131072 | 8192 | — | — |
deepseek-anthropic | deepseek-reasoner | deepseek-anthropic/deepseek-reasoner | 131072 | 65536 | ✅ | — |
deepseek-anthropic | deepseek-v4-flash | deepseek-anthropic/deepseek-v4-flash | 1000000 | 384000 | ✅ | — |
deepseek-anthropic | deepseek-v4-pro | deepseek-anthropic/deepseek-v4-pro | 1000000 | 384000 | ✅ | — |
deepseek-openai | deepseek-chat | deepseek-openai/deepseek-chat | 131072 | 8192 | — | — |
deepseek-openai | deepseek-reasoner | deepseek-openai/deepseek-reasoner | 131072 | 65536 | ✅ | — |
deepseek-openai | deepseek-v4-flash | deepseek-openai/deepseek-v4-flash | 1000000 | 384000 | ✅ | — |
deepseek-openai | deepseek-v4-pro | deepseek-openai/deepseek-v4-pro | 1000000 | 384000 | ✅ | — |
fireworks | accounts/fireworks/models/kimi-k2p6 | fireworks/accounts/fireworks/models/kimi-k2p6 | 262144 | 262144 | — | ✅ |
fireworks | accounts/fireworks/routers/kimi-k2p5-turbo | fireworks/accounts/fireworks/routers/kimi-k2p5-turbo | 256000 | 256000 | — | ✅ |
huggingface | moonshotai/Kimi-K2-Instruct | huggingface/moonshotai/Kimi-K2-Instruct | 262144 | 32768 | — | — |
kilocode | kilo/auto | kilocode/kilo/auto | 1000000 | 128000 | ✅ | ✅ |
litellm | claude-opus-4-6 | litellm/claude-opus-4-6 | 200000 | 128000 | ✅ | ✅ |
minimax | MiniMax-M2.7 | minimax/MiniMax-M2.7 | 200000 | 128000 | ✅ | — |
minimax | MiniMax-M2.7-highspeed | minimax/MiniMax-M2.7-highspeed | 200000 | 128000 | ✅ | — |
minimax-portal | MiniMax-M2.7 | minimax-portal/MiniMax-M2.7 | 200000 | 128000 | ✅ | — |
mistral | codestral-latest | mistral/codestral-latest | 256000 | 4096 | — | — |
mistral | devstral-medium-latest | mistral/devstral-medium-latest | 262144 | 32768 | — | — |
mistral | magistral-small | mistral/magistral-small | 128000 | 40000 | ✅ | — |
mistral | mistral-large-latest | mistral/mistral-large-latest | 262144 | 16384 | — | ✅ |
mistral | mistral-medium-2508 | mistral/mistral-medium-2508 | 262144 | 8192 | — | ✅ |
mistral | mistral-small-latest | mistral/mistral-small-latest | 128000 | 16384 | ✅ | ✅ |
mistral | pixtral-large-latest | mistral/pixtral-large-latest | 128000 | 32768 | — | ✅ |
moonshot | kimi-k2-thinking | moonshot/kimi-k2-thinking | 262144 | 262144 | ✅ | — |
moonshot | kimi-k2-thinking-turbo | moonshot/kimi-k2-thinking-turbo | 262144 | 262144 | ✅ | — |
moonshot | kimi-k2-turbo | moonshot/kimi-k2-turbo | 256000 | 16384 | — | — |
moonshot | kimi-k2.5 | moonshot/kimi-k2.5 | 262144 | 262144 | — | ✅ |
moonshot | kimi-k2.6 | moonshot/kimi-k2.6 | 262144 | 262144 | — | ✅ |
nearai | Qwen/Qwen3-VL-30B-A3B-Instruct | nearai/Qwen/Qwen3-VL-30B-A3B-Instruct | 256000 | 65536 | ✅ | ✅ |
nearai | Qwen/Qwen3.5-122B-A10B | nearai/Qwen/Qwen3.5-122B-A10B | 131072 | 65536 | ✅ | — |
nearai | Qwen/Qwen3.6-35B-A3B-FP8 | nearai/Qwen/Qwen3.6-35B-A3B-FP8 | 262144 | 65536 | ✅ | — |
nearai | google/gemma-4-31B-it | nearai/google/gemma-4-31B-it | 262144 | 32768 | — | — |
nearai | zai-org/GLM-5.1-FP8 | nearai/zai-org/GLM-5.1-FP8 | 202752 | 131100 | ✅ | — |
nvidia | minimaxai/minimax-m2.5 | nvidia/minimaxai/minimax-m2.5 | 196608 | 8192 | — | — |
nvidia | moonshotai/kimi-k2.5 | nvidia/moonshotai/kimi-k2.5 | 262144 | 8192 | — | — |
nvidia | nvidia/nemotron-3-super-120b-a12b | nvidia/nvidia/nemotron-3-super-120b-a12b | 262144 | 8192 | — | — |
nvidia | z-ai/glm5 | nvidia/z-ai/glm5 | 202752 | 8192 | — | — |
openai | gpt-5.2 | openai/gpt-5.2 | 272000 | 128000 | ✅ | ✅ |
openai | gpt-5.3 | openai/gpt-5.3 | 128000 | — | ✅ | ✅ |
openai | gpt-5.4 | openai/gpt-5.4 | 272000 | — | ✅ | ✅ |
openai | gpt-5.4-mini | openai/gpt-5.4-mini | 128000 | — | ✅ | ✅ |
openai | gpt-5.5 | openai/gpt-5.5 | 272000 | 128000 | ✅ | ✅ |
openai-codex | gpt-5.2 | openai-codex/gpt-5.2 | 272000 | 128000 | ✅ | ✅ |
openai-codex | gpt-5.3-codex | openai-codex/gpt-5.3-codex | 272000 | — | ✅ | ✅ |
openai-codex | gpt-5.3-codex-spark | openai-codex/gpt-5.3-codex-spark | 128000 | — | ✅ | ✅ |
openai-codex | gpt-5.4 | openai-codex/gpt-5.4 | 272000 | — | ✅ | ✅ |
openai-codex | gpt-5.4-mini | openai-codex/gpt-5.4-mini | 272000 | 128000 | ✅ | ✅ |
openai-codex | gpt-5.5 | openai-codex/gpt-5.5 | 272000 | 128000 | ✅ | ✅ |
opencode-go | deepseek-v4-flash | opencode-go/deepseek-v4-flash | 1000000 | 384000 | ✅ | — |
opencode-go | deepseek-v4-pro | opencode-go/deepseek-v4-pro | 1000000 | 384000 | ✅ | — |
openrouter | auto | openrouter/auto | 200000 | 8192 | — | ✅ |
openrouter | moonshotai/kimi-k2.6 | openrouter/moonshotai/kimi-k2.6 | 262144 | 262144 | ✅ | ✅ |
openrouter | openrouter/healer-alpha | openrouter/openrouter/healer-alpha | 262144 | 65536 | ✅ | ✅ |
openrouter | openrouter/hunter-alpha | openrouter/openrouter/hunter-alpha | 1048576 | 65536 | ✅ | — |
qianfan | deepseek-v3.2 | qianfan/deepseek-v3.2 | 98304 | 32768 | ✅ | — |
qianfan | ernie-5.0-thinking-preview | qianfan/ernie-5.0-thinking-preview | 119000 | 64000 | ✅ | ✅ |
qwen | MiniMax-M2.5 | qwen/MiniMax-M2.5 | 1000000 | 65536 | ✅ | — |
qwen | glm-4.7 | qwen/glm-4.7 | 202752 | 16384 | — | — |
qwen | glm-5 | qwen/glm-5 | 202752 | 16384 | — | — |
qwen | kimi-k2.5 | qwen/kimi-k2.5 | 262144 | 32768 | — | ✅ |
qwen | qwen3-coder-next | qwen/qwen3-coder-next | 262144 | 65536 | — | — |
qwen | qwen3-coder-plus | qwen/qwen3-coder-plus | 1000000 | 65536 | — | — |
qwen | qwen3-max-2026-01-23 | qwen/qwen3-max-2026-01-23 | 262144 | 65536 | — | — |
qwen | qwen3.5-plus | qwen/qwen3.5-plus | 1000000 | 65536 | — | ✅ |
qwen | qwen3.6-plus | qwen/qwen3.6-plus | 1000000 | 65536 | — | ✅ |
stepfun | step-3.5-flash | stepfun/step-3.5-flash | 262144 | 65536 | ✅ | — |
stepfun-plan | step-3.5-flash | stepfun-plan/step-3.5-flash | 262144 | 65536 | ✅ | — |
stepfun-plan | step-3.5-flash-2603 | stepfun-plan/step-3.5-flash-2603 | 262144 | 65536 | ✅ | — |
synthetic | hf:MiniMaxAI/MiniMax-M2.5 | synthetic/hf:MiniMaxAI/MiniMax-M2.5 | 192000 | 65536 | — | — |
synthetic | hf:Qwen/Qwen3-235B-A22B-Instruct-2507 | synthetic/hf:Qwen/Qwen3-235B-A22B-Instruct-2507 | 256000 | 8192 | — | — |
synthetic | hf:Qwen/Qwen3-235B-A22B-Thinking-2507 | synthetic/hf:Qwen/Qwen3-235B-A22B-Thinking-2507 | 256000 | 8192 | ✅ | — |
synthetic | hf:Qwen/Qwen3-Coder-480B-A35B-Instruct | synthetic/hf:Qwen/Qwen3-Coder-480B-A35B-Instruct | 256000 | 8192 | — | — |
synthetic | hf:Qwen/Qwen3-VL-235B-A22B-Instruct | synthetic/hf:Qwen/Qwen3-VL-235B-A22B-Instruct | 250000 | 8192 | — | ✅ |
synthetic | hf:deepseek-ai/DeepSeek-R1-0528 | synthetic/hf:deepseek-ai/DeepSeek-R1-0528 | 128000 | 8192 | — | — |
synthetic | hf:deepseek-ai/DeepSeek-V3 | synthetic/hf:deepseek-ai/DeepSeek-V3 | 128000 | 8192 | — | — |
synthetic | hf:deepseek-ai/DeepSeek-V3-0324 | synthetic/hf:deepseek-ai/DeepSeek-V3-0324 | 128000 | 8192 | — | — |
synthetic | hf:deepseek-ai/DeepSeek-V3.1 | synthetic/hf:deepseek-ai/DeepSeek-V3.1 | 128000 | 8192 | — | — |
synthetic | hf:deepseek-ai/DeepSeek-V3.1-Terminus | synthetic/hf:deepseek-ai/DeepSeek-V3.1-Terminus | 128000 | 8192 | — | — |
synthetic | hf:deepseek-ai/DeepSeek-V3.2 | synthetic/hf:deepseek-ai/DeepSeek-V3.2 | 159000 | 8192 | — | — |
synthetic | hf:meta-llama/Llama-3.3-70B-Instruct | synthetic/hf:meta-llama/Llama-3.3-70B-Instruct | 128000 | 8192 | — | — |
synthetic | hf:meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | synthetic/hf:meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | 524000 | 8192 | — | — |
synthetic | hf:moonshotai/Kimi-K2-Instruct-0905 | synthetic/hf:moonshotai/Kimi-K2-Instruct-0905 | 256000 | 8192 | — | — |
synthetic | hf:moonshotai/Kimi-K2-Thinking | synthetic/hf:moonshotai/Kimi-K2-Thinking | 256000 | 8192 | ✅ | — |
synthetic | hf:moonshotai/Kimi-K2.5 | synthetic/hf:moonshotai/Kimi-K2.5 | 256000 | 8192 | ✅ | ✅ |
synthetic | hf:openai/gpt-oss-120b | synthetic/hf:openai/gpt-oss-120b | 128000 | 8192 | — | — |
synthetic | hf:zai-org/GLM-4.5 | synthetic/hf:zai-org/GLM-4.5 | 128000 | 128000 | — | — |
synthetic | hf:zai-org/GLM-4.6 | synthetic/hf:zai-org/GLM-4.6 | 198000 | 128000 | — | — |
synthetic | hf:zai-org/GLM-4.7 | synthetic/hf:zai-org/GLM-4.7 | 198000 | 128000 | — | — |
synthetic | hf:zai-org/GLM-5 | synthetic/hf:zai-org/GLM-5 | 256000 | 128000 | ✅ | ✅ |
tencent-tokenhub | hy3-preview | tencent-tokenhub/hy3-preview | 256000 | 64000 | ✅ | — |
together | deepseek-ai/DeepSeek-R1 | together/deepseek-ai/DeepSeek-R1 | 131072 | 8192 | ✅ | — |
together | deepseek-ai/DeepSeek-V3.1 | together/deepseek-ai/DeepSeek-V3.1 | 131072 | 8192 | — | — |
together | meta-llama/Llama-3.3-70B-Instruct-Turbo | together/meta-llama/Llama-3.3-70B-Instruct-Turbo | 131072 | 8192 | — | — |
together | meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | together/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | 20000000 | 32768 | — | ✅ |
together | meta-llama/Llama-4-Scout-17B-16E-Instruct | together/meta-llama/Llama-4-Scout-17B-16E-Instruct | 10000000 | 32768 | — | ✅ |
together | moonshotai/Kimi-K2-Instruct-0905 | together/moonshotai/Kimi-K2-Instruct-0905 | 262144 | 8192 | — | — |
together | moonshotai/Kimi-K2.5 | together/moonshotai/Kimi-K2.5 | 262144 | 32768 | ✅ | ✅ |
together | zai-org/GLM-4.7 | together/zai-org/GLM-4.7 | 202752 | 8192 | — | — |
venice | claude-opus-4-6 | venice/claude-opus-4-6 | 1000000 | 128000 | ✅ | ✅ |
venice | claude-sonnet-4-6 | venice/claude-sonnet-4-6 | 1000000 | 128000 | ✅ | ✅ |
vercel-ai-gateway | anthropic/claude-opus-4.6 | vercel-ai-gateway/anthropic/claude-opus-4.6 | 1000000 | 128000 | ✅ | ✅ |
vercel-ai-gateway | moonshotai/kimi-k2.6 | vercel-ai-gateway/moonshotai/kimi-k2.6 | 262144 | 262144 | ✅ | ✅ |
vercel-ai-gateway | openai/gpt-5.4 | vercel-ai-gateway/openai/gpt-5.4 | 200000 | 128000 | ✅ | ✅ |
vercel-ai-gateway | openai/gpt-5.4-pro | vercel-ai-gateway/openai/gpt-5.4-pro | 200000 | 128000 | ✅ | ✅ |
vllm | meta-llama/Meta-Llama-3-8B-Instruct | vllm/meta-llama/Meta-Llama-3-8B-Instruct | 131072 | 8192 | — | — |
volcengine | deepseek-v3-2-251201 | volcengine/deepseek-v3-2-251201 | 128000 | 4096 | — | ✅ |
volcengine | doubao-seed-1-8-251228 | volcengine/doubao-seed-1-8-251228 | 256000 | 4096 | — | ✅ |
volcengine | doubao-seed-code-preview-251028 | volcengine/doubao-seed-code-preview-251028 | 256000 | 4096 | — | ✅ |
volcengine-coding | ark-code-latest | volcengine-coding/ark-code-latest | 256000 | 65536 | ✅ | — |
volcengine-coding | doubao-seed-code-preview-251028 | volcengine-coding/doubao-seed-code-preview-251028 | 256000 | 4096 | — | — |
xai | grok-3 | xai/grok-3 | 131072 | 8192 | — | — |
xai | grok-3-fast | xai/grok-3-fast | 131072 | 8192 | — | — |
xai | grok-3-mini | xai/grok-3-mini | 131072 | 8192 | ✅ | — |
xai | grok-3-mini-fast | xai/grok-3-mini-fast | 131072 | 8192 | ✅ | — |
xai | grok-4 | xai/grok-4 | 256000 | 64000 | ✅ | — |
xai | grok-4-1-fast | xai/grok-4-1-fast | 2000000 | 30000 | ✅ | ✅ |
xai | grok-4-fast | xai/grok-4-fast | 2000000 | 30000 | ✅ | ✅ |
xai | grok-4-fast-non-reasoning | xai/grok-4-fast-non-reasoning | 2000000 | 30000 | — | ✅ |
xai | grok-code-fast-1 | xai/grok-code-fast-1 | 256000 | 10000 | ✅ | — |
xiaomi | mimo-v2-flash | xiaomi/mimo-v2-flash | 262144 | 8192 | — | — |
xiaomi | mimo-v2-omni | xiaomi/mimo-v2-omni | 262144 | 32000 | ✅ | ✅ |
xiaomi | mimo-v2-pro | xiaomi/mimo-v2-pro | 1048576 | 32000 | ✅ | — |
xiaomi-anthropic | mimo-v2-flash | xiaomi-anthropic/mimo-v2-flash | 262144 | 8192 | — | — |
xiaomi-anthropic | mimo-v2-omni | xiaomi-anthropic/mimo-v2-omni | 262144 | 32000 | ✅ | ✅ |
xiaomi-anthropic | mimo-v2-pro | xiaomi-anthropic/mimo-v2-pro | 1048576 | 32000 | ✅ | — |
xiaomi-openai | mimo-v2-flash | xiaomi-openai/mimo-v2-flash | 262144 | 8192 | — | — |
xiaomi-openai | mimo-v2-omni | xiaomi-openai/mimo-v2-omni | 262144 | 32000 | ✅ | ✅ |
xiaomi-openai | mimo-v2-pro | xiaomi-openai/mimo-v2-pro | 1048576 | 32000 | ✅ | — |
xiaomi-token-plan | mimo-v2-flash | xiaomi-token-plan/mimo-v2-flash | 262144 | 8192 | — | — |
xiaomi-token-plan | mimo-v2-omni | xiaomi-token-plan/mimo-v2-omni | 262144 | 32000 | ✅ | ✅ |
xiaomi-token-plan | mimo-v2-pro | xiaomi-token-plan/mimo-v2-pro | 1048576 | 32000 | ✅ | — |
xiaomi-token-plan-anthropic | mimo-v2-flash | xiaomi-token-plan-anthropic/mimo-v2-flash | 262144 | 8192 | — | — |
xiaomi-token-plan-anthropic | mimo-v2-omni | xiaomi-token-plan-anthropic/mimo-v2-omni | 262144 | 32000 | ✅ | ✅ |
xiaomi-token-plan-anthropic | mimo-v2-pro | xiaomi-token-plan-anthropic/mimo-v2-pro | 1048576 | 32000 | ✅ | — |
xiaomi-token-plan-openai | mimo-v2-flash | xiaomi-token-plan-openai/mimo-v2-flash | 262144 | 8192 | — | — |
xiaomi-token-plan-openai | mimo-v2-omni | xiaomi-token-plan-openai/mimo-v2-omni | 262144 | 32000 | ✅ | ✅ |
xiaomi-token-plan-openai | mimo-v2-pro | xiaomi-token-plan-openai/mimo-v2-pro | 1048576 | 32000 | ✅ | — |
zai | glm-4.5 | zai/glm-4.5 | 131072 | 98304 | ✅ | — |
zai | glm-4.5-air | zai/glm-4.5-air | 131072 | 98304 | ✅ | — |
zai | glm-4.5-flash | zai/glm-4.5-flash | 131072 | 98304 | ✅ | — |
zai | glm-4.5v | zai/glm-4.5v | 64000 | 16384 | ✅ | ✅ |
zai | glm-4.6 | zai/glm-4.6 | 204800 | 131072 | ✅ | — |
zai | glm-4.6v | zai/glm-4.6v | 128000 | 32768 | ✅ | ✅ |
zai | glm-4.7 | zai/glm-4.7 | 204800 | 131072 | ✅ | — |
zai | glm-4.7-flash | zai/glm-4.7-flash | 200000 | 131072 | ✅ | — |
zai | glm-4.7-flashx | zai/glm-4.7-flashx | 200000 | 128000 | ✅ | — |
zai | glm-5 | zai/glm-5 | 202800 | 131100 | ✅ | — |
zai | glm-5-turbo | zai/glm-5-turbo | 202800 | 131100 | ✅ | — |
zai | glm-5.1 | zai/glm-5.1 | 202800 | 131100 | ✅ | — |
zai | glm-5v-turbo | zai/glm-5v-turbo | 202800 | 131100 | ✅ | ✅ |
zai-anthropic | glm-4.5 | zai-anthropic/glm-4.5 | 131072 | 98304 | ✅ | — |
zai-anthropic | glm-4.5-air | zai-anthropic/glm-4.5-air | 131072 | 98304 | ✅ | — |
zai-anthropic | glm-4.5-flash | zai-anthropic/glm-4.5-flash | 131072 | 98304 | ✅ | — |
zai-anthropic | glm-4.5v | zai-anthropic/glm-4.5v | 64000 | 16384 | ✅ | ✅ |
zai-anthropic | glm-4.6 | zai-anthropic/glm-4.6 | 204800 | 131072 | ✅ | — |
zai-anthropic | glm-4.6v | zai-anthropic/glm-4.6v | 128000 | 32768 | ✅ | ✅ |
zai-anthropic | glm-4.7 | zai-anthropic/glm-4.7 | 204800 | 131072 | ✅ | — |
zai-anthropic | glm-4.7-flash | zai-anthropic/glm-4.7-flash | 200000 | 131072 | ✅ | — |
zai-anthropic | glm-4.7-flashx | zai-anthropic/glm-4.7-flashx | 200000 | 128000 | ✅ | — |
zai-anthropic | glm-5 | zai-anthropic/glm-5 | 202800 | 131100 | ✅ | — |
zai-anthropic | glm-5-turbo | zai-anthropic/glm-5-turbo | 202800 | 131100 | ✅ | — |
zai-anthropic | glm-5.1 | zai-anthropic/glm-5.1 | 202800 | 131100 | ✅ | — |
zai-anthropic | glm-5v-turbo | zai-anthropic/glm-5v-turbo | 202800 | 131100 | ✅ | ✅ |
zai-openai | glm-4.5 | zai-openai/glm-4.5 | 131072 | 98304 | ✅ | — |
zai-openai | glm-4.5-air | zai-openai/glm-4.5-air | 131072 | 98304 | ✅ | — |
zai-openai | glm-4.5-flash | zai-openai/glm-4.5-flash | 131072 | 98304 | ✅ | — |
zai-openai | glm-4.5v | zai-openai/glm-4.5v | 64000 | 16384 | ✅ | ✅ |
zai-openai | glm-4.6 | zai-openai/glm-4.6 | 204800 | 131072 | ✅ | — |
zai-openai | glm-4.6v | zai-openai/glm-4.6v | 128000 | 32768 | ✅ | ✅ |
zai-openai | glm-4.7 | zai-openai/glm-4.7 | 204800 | 131072 | ✅ | — |
zai-openai | glm-4.7-flash | zai-openai/glm-4.7-flash | 200000 | 131072 | ✅ | — |
zai-openai | glm-4.7-flashx | zai-openai/glm-4.7-flashx | 200000 | 128000 | ✅ | — |
zai-openai | glm-5 | zai-openai/glm-5 | 202800 | 131100 | ✅ | — |
zai-openai | glm-5-turbo | zai-openai/glm-5-turbo | 202800 | 131100 | ✅ | — |
zai-openai | glm-5.1 | zai-openai/glm-5.1 | 202800 | 131100 | ✅ | — |
zai-openai | glm-5v-turbo | zai-openai/glm-5v-turbo | 202800 | 131100 | ✅ | ✅ |
