List supported model IDs for inference
| Model ID | Provider | Context |
|---|---|---|
claude-opus-4-6 | Anthropic | 1M |
claude-opus-4-5 | Anthropic | 200k |
claude-sonnet-4-6 | Anthropic | 1M |
claude-sonnet-4-5 | Anthropic | 200k |
claude-sonnet-4 | Anthropic | 200k |
claude-haiku-4-5 | Anthropic | 200k |
claude-haiku-3-5 | Anthropic | 200k |
gpt-5.2 | OpenAI | 400k |
gpt-5-3-codex | OpenAI | 400k |
gpt-5 | OpenAI | 400k |
gpt-5-mini | OpenAI | 400k |
gpt-5-nano | OpenAI | 400k |
gpt-4o | OpenAI | 128k |
gpt-4o-mini | OpenAI | 128k |
gpt-4.1 | OpenAI | 1M |
grok-4-1-fast-reasoning | xAI | 2M |
grok-4-1-fast-non-reasoning | xAI | 2M |
grok-code-fast-1 | xAI | 256k |
gemini-3-1-pro-preview | 1M | |
gemini-3-pro-preview | 1M | |
gemini-3-flash-preview | 1M | |
gemini-2.5-pro | 1M | |
gemini-2.5-flash | 1M | |
gemini-2.5-flash-lite | 1M | |
deepseek-v3-2 | Fireworks | 128k |
deepseek-r1 | Bedrock | 128k |
gpt-oss-120b | Bedrock | 128k |
gpt-oss-20b | Bedrock | 128k |
minimax-m2.5 | Fireworks AI | 192k |
kimi-k2.5 | Fireworks AI | 262k |
kimi-k2-thinking | Bedrock | 262k |
zai-glm-5 | Fireworks AI | 198k |
zai-glm-4.7 | Bedrock | 128k |
zai-glm-4.7-flash | Bedrock | 128k |
f1 | Firmware | 128k |
f1-flash | Firmware | 1M |
f1-pro | Firmware | 1M |
| Model ID | Provider | Price per 1M tokens |
|---|---|---|
text-embedding-3-small | OpenAI | $0.02 |
text-embedding-3-large | OpenAI | $0.13 |
text-embedding-ada-002 | OpenAI | $0.10 |
text-embedding-004 | Free | |
voyage-4-large | Voyage AI | $0.12 |
voyage-4 | Voyage AI | $0.06 |
voyage-4-lite | Voyage AI | $0.02 |
voyage-context-3 | Voyage AI | $0.18 |
voyage-code-3 | Voyage AI | $0.18 |
voyage-finance-2 | Voyage AI | $0.12 |
voyage-law-2 | Voyage AI | $0.12 |
voyage-code-2 | Voyage AI | $0.12 |
| Model ID | Provider | Price per 1M tokens |
|---|---|---|
rerank-2.5 | Voyage AI | $0.05 |
rerank-2.5-lite | Voyage AI | $0.02 |
| Model ID | Provider | Price |
|---|---|---|
whisper-1 | OpenAI | $0.006/min |
elevenlabs-scribe-v2 | ElevenLabs | $0.40/hr |
| Model ID | Provider | Price per 1M chars |
|---|---|---|
elevenlabs-tts-multilingual | ElevenLabs | $170.00 |
elevenlabs-tts-v3 | ElevenLabs | $170.00 |
| Model ID | Provider | Price |
|---|---|---|
dall-e-3 | OpenAI | 0.12 per image |
imagen-4 | $0.04 per image | |
imagen-4-ultra | $0.06 per image | |
imagen-4-fast | $0.02 per image | |
gemini-3-pro-image-preview | Token-based | |
gemini-3-1-flash-image-preview | Token-based | |
gemini-2.5-flash-image | Token-based |
GET request with your Firmware API key.
Authorization header.
list.401 means the API key is missing or invalid.
429 means you hit a usage or plan limit.