Portkey Models is a free, open-source pricing database for LLMs. No API key required — just query any model’s pricing directly.
Quick Start
Get pricing for any model with a single API call:
curl https://api.portkey.ai/model-configs/pricing/openai/gpt-4o
Need to discover all models for a provider? Instead of querying each model individually, you can fetch pricing for all models at once:curl https://configs.portkey.ai/pricing/{provider}.json
See detailed documentation below ↓
Understanding Pricing Units
Prices are in cents per token, not dollars.
API Response Per 1K Tokens Per 1M Tokens 0.003$0.03 $30 0.00025$0.0025 $2.50 0.0001$0.001 $1.00
Calculate cost in dollars:
const costInDollars = ( tokens * priceFromAPI ) / 100 ;
API Reference
Get Model Pricing
Returns pricing configuration for a specific model.
GET https://api.portkey.ai/model-configs/pricing/{provider}/{model}
Path Parameters
Provider identifier. Use lowercase with hyphens. Examples: openai, anthropic, google, azure-openai, bedrock, together-ai, groq, deepseek, x-ai, mistral-ai, cohere, fireworks-ai, perplexity-ai, anyscale, deepinfra, cerebras
Model identifier. Use the exact model name as specified by the provider. Examples: gpt-4o, gpt-4-turbo, claude-3-5-sonnet-20241022, claude-3-opus-20240229, gemini-2.0-flash-001, gemini-1.5-pro
Response Schema
Token-based pricing. All prices in USD cents per token . Input token pricing. Price in USD cents per token. Example: 0.00025 = $0.0025 per 1K tokens
Output token pricing. Price in USD cents per token.
Prompt cache read pricing (when available). Price in USD cents per token.
Prompt cache write pricing (when available). Price in USD cents per token.
Audio input token pricing (for multimodal models). Price in USD cents per token.
Audio output token pricing (for multimodal models). Price in USD cents per token.
Provider-specific pricing for features beyond tokens. Common units:
web_search — Web search tool usage
file_search — File search tool usage
thinking_token — Chain-of-thought reasoning tokens
image_token — Image generation/processing
video_duration_seconds_* — Video generation (by resolution)
Image generation pricing by quality and size (for image models). Structure: { "quality": { "resolution": { "price": number } } }
Cost calculation formulas for complex pricing scenarios. Formula for calculating request (input) cost.
Formula for calculating response (output) cost.
Currency code. Always USD.
Fine-tuning pricing (when available).
Example Responses
OpenAI GPT-4o
Anthropic Claude 3.5 Sonnet
Google Gemini 2.5 Pro
OpenAI DALL-E 3 (Image Model)
{
"pay_as_you_go" : {
"request_token" : { "price" : 0.00025 },
"response_token" : { "price" : 0.001 },
"cache_write_input_token" : { "price" : 0 },
"cache_read_input_token" : { "price" : 0.000125 },
"additional_units" : {
"web_search" : { "price" : 1 },
"file_search" : { "price" : 0.25 }
}
},
"calculate" : {
"request" : {
"operation" : "sum" ,
"operands" : [
{ "operation" : "multiply" , "operands" : [{ "value" : "input_tokens" }, { "value" : "rates.request_token" }] },
{ "operation" : "multiply" , "operands" : [{ "value" : "cache_write_tokens" }, { "value" : "rates.cache_write_input_token" }] },
{ "operation" : "multiply" , "operands" : [{ "value" : "cache_read_tokens" }, { "value" : "rates.cache_read_input_token" }] }
]
},
"response" : {
"operation" : "multiply" ,
"operands" : [{ "value" : "output_tokens" }, { "value" : "rates.response_token" }]
}
},
"currency" : "USD"
}
Get Model Configuration
Returns general configuration and capabilities for a specific model.
GET https://api.portkey.ai/model-configs/general/{provider}/{model}
Path Parameters
Provider identifier (e.g., openai, anthropic, google)
Model identifier (e.g., gpt-4o, claude-3-opus)
Response Schema
Model parameters with their constraints. Parameter name (e.g., max_tokens, temperature, top_p)
Default value for the parameter.
Parameter type (e.g., boolean, string, array-of-strings)
Available options for enum-type parameters.
Model type classification. Primary model type: chat, text, embedding, image, audio, moderation
Supported features: tools, image, cache_control, json_mode
Message configuration. Supported message roles: system, user, assistant, developer
If true, model is not available in Portkey playground.
If true, this is the default model for the provider.
Example Responses
OpenAI GPT-4o
Anthropic Claude 3 Opus
{
"params" : [
{
"key" : "max_tokens" ,
"maxValue" : 16384
},
{
"key" : "response_format" ,
"defaultValue" : null ,
"options" : [
{ "value" : null , "name" : "Text" },
{ "value" : "json_object" , "name" : "JSON Object" },
{ "value" : "json_schema" , "name" : "JSON Schema" }
],
"type" : "string"
},
{
"key" : "temperature" ,
"minValue" : 0 ,
"maxValue" : 2 ,
"defaultValue" : 1
}
],
"type" : {
"primary" : "chat" ,
"supported" : [ "tools" , "image" ]
}
}
Get All Models for a Provider
Returns pricing configuration for all models from a specific provider in a single response.
GET https://configs.portkey.ai/pricing/{provider}.json
Use this endpoint to discover all available models and their pricing for a provider, instead of querying each model individually.
Path Parameters
Provider identifier. Use lowercase with hyphens. Examples: openai, anthropic, google, bedrock, azure-openai, together-ai, groq, deepseek, x-ai, mistral-ai
Response Schema
Returns a JSON object where:
Keys are model identifiers
Values contain pricing_config objects with the same structure as the individual model pricing endpoint
A default key provides the base pricing template
Pricing configuration for a specific model. Contains the same pricing structure as the individual model endpoint: pay_as_you_go, calculate, currency, etc.
Example Response
curl --request GET \
--url https://configs.portkey.ai/pricing/bedrock.json
Sample Response (Bedrock)
{
"default" : {
"pricing_config" : {
"pay_as_you_go" : {
"request_token" : { "price" : 0 },
"response_token" : { "price" : 0 }
},
"calculate" : {
"request" : {
"operation" : "multiply" ,
"operands" : [
{ "value" : "input_tokens" },
{ "value" : "rates.request_token" }
]
},
"response" : {
"operation" : "multiply" ,
"operands" : [
{ "value" : "output_tokens" },
{ "value" : "rates.response_token" }
]
}
},
"currency" : "USD"
}
},
"anthropic.claude-3-5-sonnet-20241022-v2:0" : {
"pricing_config" : {
"pay_as_you_go" : {
"request_token" : { "price" : 0.0003 },
"response_token" : { "price" : 0.0015 },
"cache_write_input_token" : { "price" : 0.000375 },
"cache_read_input_token" : { "price" : 0.00003 }
}
}
},
"anthropic.claude-3-haiku-20240307-v1:0" : {
"pricing_config" : {
"pay_as_you_go" : {
"request_token" : { "price" : 0.000025 },
"response_token" : { "price" : 0.000125 }
}
}
},
"meta.llama3-1-405b-instruct-v1:0" : {
"pricing_config" : {
"pay_as_you_go" : {
"request_token" : { "price" : 0.000532 },
"response_token" : { "price" : 0.0016 }
}
}
},
"amazon.nova-pro-v1:0" : {
"pricing_config" : {
"pay_as_you_go" : {
"request_token" : { "price" : 0.00008 },
"response_token" : { "price" : 0.00002 },
"cache_read_input_token" : { "price" : 0.00004 },
"cache_write_input_token" : { "price" : 0.00016 }
}
}
}
}
Additional Units Reference
Provider-specific pricing for features beyond standard token costs:
Unit Description Providers Price Range (¢) web_searchWeb search tool usage OpenAI, Azure, Google, Vertex AI, Perplexity 0.5 - 3.5 file_searchFile search tool usage OpenAI, Azure 0.25 searchGoogle search grounding Google, Vertex AI 1.4 - 3.5 thinking_tokenChain-of-thought reasoning Google, Vertex AI 0.00004 - 0.0012 image_tokenImage processing tokens Google, Vertex AI 0.003 image_1kImage generation (1K units) Google 3.9 megapixelsImage generation by megapixel Together AI 0.0027 - 0.08 video_secondsVideo generation Vertex AI 10 - 50 video_duration_seconds_720_1280Video (720p) OpenAI Sora 10 - 30 video_duration_seconds_1080_1920Video (1080p) OpenAI Sora 50 routing_unitsAzure OpenAI routing Azure OpenAI 0.000014 input_imageImage input Vertex AI 0.01 input_video_essentialVideo input (essential) Vertex AI 0.05 input_video_standardVideo input (standard) Vertex AI 0.1 input_video_plusVideo input (plus) Vertex AI 0.2
Perplexity Context-Based Pricing
Perplexity has context-dependent web search pricing: Unit Price (¢) web_search_low_context0.5 - 0.6 web_search_medium_context0.8 - 1.0 web_search_high_context1.2 - 1.4
Supported Providers
AI21, Anthropic, Anyscale, Azure AI, Azure OpenAI, AWS Bedrock, Cerebras, Cohere, Dashscope, Deepbricks, DeepInfra, DeepSeek, Fireworks AI, GitHub, Google, Groq, Inference.net, Jina, Lambda, Lemonfox AI, Mistral AI, MonsterAPI, Nebius, Nomic, Novita AI, OpenAI, OpenRouter, Oracle, PaLM, Perplexity AI, Predibase, Reka AI, Sagemaker, Segmind, Stability AI, Together AI, Vertex AI, Workers AI, X.AI, Zhipu
Use with Portkey
Portkey Models powers automatic cost tracking for all requests through the Portkey Gateway. When you make requests via Portkey, costs are calculated automatically using this pricing data.
If you have negotiated enterprise rates, you can override the default pricing:
Custom Pricing Set custom input/output costs to match your contracts