model_router
azure_ai · chat model
Input
$0.1400 / 1M tokens
Output
N/A
Cached input
N/A
Context
N/A
Pricing
| Item | Raw value (per token) | Normalized |
|---|---|---|
| Input | 1.4e-7 | $0.1400 / 1M tokens |
| Output | 0 | N/A |
| Embedding | 1.4e-7 | $0.1400 / 1M tokens |
Token limits
Capabilities
| Capability | Supported |
|---|---|
| Vision | No |
| Function calling | No |
| Parallel function calling | No |
| Tool choice | No |
| Prompt caching | No |
| Reasoning | No |
| Response schema | No |
| System messages | No |
| Audio input | No |
| Audio output | No |
| Web search | No |
| PDF input | No |
| Video input | No |
Similar models
Models with comparable pricing in the same mode (chat).
| Model | Provider | Input | Output | Context | Coding | Features |
|---|---|---|---|---|---|---|
| codestral-embed | vercel_ai_gateway | $0.1500 / 1M tokens | N/A | 0 | N/A | |
| embed-v4.0 | vercel_ai_gateway | $0.1200 / 1M tokens | N/A | 0 | N/A | |
| command-r7b-12-2024 | cohere_chat | $0.1500 / 1M tokens | $0.0375 / 1M tokens | 4.1K | N/A | Tools |
| mistral-embed | vercel_ai_gateway | $0.1000 / 1M tokens | N/A | 0 | N/A | |
| DeepSeek-R1-Distill-Qwen-14B | nscale | $0.0700 / 1M tokens | $0.0700 / 1M tokens | N/A | N/A | |
| baichuan-m2-32b | novita | $0.0700 / 1M tokens | $0.0700 / 1M tokens | 131.1K | N/A |
Sources
| Pricing source | LiteLLM model cost map |
| Synced at | 2026-05-27 |
| Manual review | Not reviewed |
Raw LiteLLM fields
{
"input_cost_per_token": 1.4e-7,
"output_cost_per_token": 0,
"litellm_provider": "azure_ai",
"mode": "chat",
"source": "https://azure.microsoft.com/en-us/pricing/details/ai-services/",
"comment": "Flat cost of $0.14 per M input tokens for Azure AI Foundry Model Router infrastructure. Use pattern: azure_ai/model_router/<deployment-name> where deployment-name is your Azure deployment (e.g., azure-model-router)"
}