Chat

Available Providers
ProviderModelVersionPriceBilling unit
amazonamazon.nova-lite-v1:0llmengine (v2)0.24 (per 1000000 token)1 token
amazonamazon.nova-micro-v1:0llmengine (v2)0.14 (per 1000000 token)1 token
amazonamazon.nova-pro-v1:0llmengine (v2)3.2 (per 1000000 token)1 token
anthropicclaude-opus-4-6v15e-06 (per 1 token)1 token
anthropicclaude-sonnet-4-20250514v11.5e-05 (per 1 token)1 token
anthropicclaude-opus-4-1-20250805v17.5e-05 (per 1 token)1 token
anthropicclaude-opus-4-5v12.5e-05 (per 1 token)1 token
anthropicclaude-3-7-sonnet-latestv11.5e-05 (per 1 token)1 token
anthropicclaude-3-5-haiku-20241022v14e-06 (per 1 token)1 token
anthropicclaude-3-5-haiku-latestv15e-06 (per 1 token)1 token
anthropicclaude-haiku-4-5-20251001v15e-06 (per 1 token)1 token
anthropicclaude-haiku-4-5v15e-06 (per 1 token)1 token
anthropicclaude-3-7-sonnet-20250219v11.5e-05 (per 1 token)1 token
anthropicclaude-3-haiku-20240307v11.25e-06 (per 1 token)1 token
anthropicclaude-4-opus-20250514v17.5e-05 (per 1 token)1 token
anthropicclaude-4-sonnet-20250514v11.5e-05 (per 1 token)1 token
anthropicclaude-sonnet-4-5v11.5e-05 (per 1 token)1 token
anthropicclaude-sonnet-4-5-20250929v11.5e-05 (per 1 token)1 token
anthropicclaude-opus-4-1v17.5e-05 (per 1 token)1 token
anthropicclaude-opus-4-20250514v17.5e-05 (per 1 token)1 token
anthropicclaude-opus-4-5-20251101v12.5e-05 (per 1 token)1 token
coherecommand-r7b-12-2024llmengine (v2)0.15 (per 1000000 token)1 token
coherecommand-r-08-2024llmengine (v2)0.6 (per 1000000 token)1 token
deepseekdeepseek-chatllmengine (v2)1.1e-06 (per 1 token)1 token
deepseekdeepseek-reasonerllmengine (v2)2.19e-06 (per 1 token)1 token
deepseekdeepseek-coderllmengine (v2)2.8e-07 (per 1 token)1 token
metameta.llama3-1-405b-instruct-v1:0llmengine (v2)2.4 (per 1000000 token)1 token
metameta.llama3-1-70b-instruct-v1:0llmengine (v2)0.72 (per 1000000 token)1 token
metameta.llama3-1-8b-instruct-v1:0llmengine (v2)0.22 (per 1000000 token)1 token
mistralmagistral-medium-2506llmengine (v2)5e-06 (per 1 token)1 token
mistralmagistral-small-2506llmengine (v2)1.5e-06 (per 1 token)1 token
mistralmistral-medium-latestllmengine (v2)2e-06 (per 1 token)1 token
mistralpixtral-large-latestllmengine (v2)6e-06 (per 1 token)1 token
mistralmistral-small-latestllmengine (v2)3e-07 (per 1 token)1 token
mistralcodestral-latestllmengine (v2)3e-06 (per 1 token)1 token
mistralmistral-large-latestllmengine (v2)6e-06 (per 1 token)1 token
mistralcodestral-2405llmengine (v2)3e-06 (per 1 token)1 token
mistralcodestral-2508llmengine (v2)9e-07 (per 1 token)1 token
mistraldevstral-medium-2507llmengine (v2)2e-06 (per 1 token)1 token
mistraldevstral-small-2505llmengine (v2)3e-07 (per 1 token)1 token
mistraldevstral-small-2507llmengine (v2)3e-07 (per 1 token)1 token
mistrallabs-devstral-small-2512llmengine (v2)3e-07 (per 1 token)1 token
mistraldevstral-2512llmengine (v2)2e-06 (per 1 token)1 token
mistralmagistral-medium-2509llmengine (v2)5e-06 (per 1 token)1 token
mistralmagistral-medium-latestllmengine (v2)5e-06 (per 1 token)1 token
mistralmagistral-small-latestllmengine (v2)1.5e-06 (per 1 token)1 token
mistralmistral-large-2402llmengine (v2)1.2e-05 (per 1 token)1 token
mistralmistral-large-2407llmengine (v2)9e-06 (per 1 token)1 token
mistralmistral-large-2411llmengine (v2)6e-06 (per 1 token)1 token
mistralmistral-mediumllmengine (v2)8.1e-06 (per 1 token)1 token
mistralmistral-medium-2312llmengine (v2)8.1e-06 (per 1 token)1 token
mistralmistral-medium-2505llmengine (v2)2e-06 (per 1 token)1 token
mistralmistral-smallllmengine (v2)3e-07 (per 1 token)1 token
mistralmistral-tinyllmengine (v2)2.5e-07 (per 1 token)1 token
mistralopen-mistral-7bllmengine (v2)2.5e-07 (per 1 token)1 token
mistralopen-mistral-nemollmengine (v2)3e-07 (per 1 token)1 token
mistralopen-mistral-nemo-2407llmengine (v2)3e-07 (per 1 token)1 token
mistralopen-mixtral-8x22bllmengine (v2)6e-06 (per 1 token)1 token
mistralopen-mixtral-8x7bllmengine (v2)7e-07 (per 1 token)1 token
mistralpixtral-12b-2409llmengine (v2)1.5e-07 (per 1 token)1 token
mistralpixtral-large-2411llmengine (v2)6e-06 (per 1 token)1 token
openaio4-minillmengine (v2)4.4e-06 (per 1 token)1 token
openaigpt-4.1-mini-2025-04-14llmengine (v2)1.6e-06 (per 1 token)1 token
openaigpt-4.1-nano-2025-04-14llmengine (v2)4e-07 (per 1 token)1 token
openaigpt-5.1llmengine (v2)1e-05 (per 1 token)1 token
openaigpt-5llmengine (v2)1e-05 (per 1 token)1 token
openaigpt-5-minillmengine (v2)2e-06 (per 1 token)1 token
openaigpt-5-nanollmengine (v2)4e-07 (per 1 token)1 token
openaigpt-5-chat-latestllmengine (v2)1e-05 (per 1 token)1 token
openaigpt-5-prollmengine (v2)100.0 (per 1000000 token)1 token
openaio4-mini-2025-04-16llmengine (v2)4.4e-06 (per 1 token)1 token
openaio3llmengine (v2)8e-06 (per 1 token)1 token
openaigpt-4llmengine (v2)6e-05 (per 1 token)1 token
openaigpt-4ollmengine (v2)1e-05 (per 1 token)1 token
openaigpt-4o-minillmengine (v2)6e-07 (per 1 token)1 token
openaigpt-4o-2024-05-13llmengine (v2)1.5e-05 (per 1 token)1 token
openaigpt-4-turbollmengine (v2)3e-05 (per 1 token)1 token
openaio1-2024-12-17llmengine (v2)6e-05 (per 1 token)1 token
openaio1llmengine (v2)6e-05 (per 1 token)1 token
openaio3-minillmengine (v2)4.4e-06 (per 1 token)1 token
openaio3-mini-2025-01-31llmengine (v2)4.4e-06 (per 1 token)1 token
openaigpt-4o-2024-08-06llmengine (v2)1e-05 (per 1 token)1 token
openaigpt-4o-mini-2024-07-18llmengine (v2)6e-07 (per 1 token)1 token
openaigpt-3.5-turbollmengine (v2)1.5e-06 (per 1 token)1 token
openaigpt-5.2llmengine (v2)1.4e-05 (per 1 token)1 token
openaichatgpt-4o-latestllmengine (v2)1.5e-05 (per 1 token)1 token
openaigpt-3.5-turbo-0125llmengine (v2)1.5e-06 (per 1 token)1 token
openaigpt-3.5-turbo-1106llmengine (v2)2e-06 (per 1 token)1 token
openaigpt-3.5-turbo-16kllmengine (v2)4e-06 (per 1 token)1 token
openaigpt-4-0125-previewllmengine (v2)3e-05 (per 1 token)1 token
openaigpt-4-0314llmengine (v2)6e-05 (per 1 token)1 token
openaigpt-4-0613llmengine (v2)6e-05 (per 1 token)1 token
openaigpt-4-1106-previewllmengine (v2)3e-05 (per 1 token)1 token
openaigpt-4-turbo-2024-04-09llmengine (v2)3e-05 (per 1 token)1 token
openaigpt-4-turbo-previewllmengine (v2)3e-05 (per 1 token)1 token
openaigpt-4.1llmengine (v2)8e-06 (per 1 token)1 token
openaigpt-4.1-2025-04-14llmengine (v2)8e-06 (per 1 token)1 token
openaigpt-4.1-minillmengine (v2)1.6e-06 (per 1 token)1 token
openaigpt-4.1-nanollmengine (v2)4e-07 (per 1 token)1 token
openaigpt-4o-2024-11-20llmengine (v2)1e-05 (per 1 token)1 token
openaigpt-4o-mini-search-previewllmengine (v2)6e-07 (per 1 token)1 token
openaigpt-4o-mini-search-preview-2025-03-11llmengine (v2)6e-07 (per 1 token)1 token
openaigpt-4o-search-previewllmengine (v2)1e-05 (per 1 token)1 token
openaigpt-4o-search-preview-2025-03-11llmengine (v2)1e-05 (per 1 token)1 token
openaigpt-5.1-2025-11-13llmengine (v2)1e-05 (per 1 token)1 token
openaigpt-5.1-chat-latestllmengine (v2)1e-05 (per 1 token)1 token
openaigpt-5.2-2025-12-11llmengine (v2)1.4e-05 (per 1 token)1 token
openaigpt-5.2-chat-latestllmengine (v2)1.4e-05 (per 1 token)1 token
openaigpt-5-2025-08-07llmengine (v2)1e-05 (per 1 token)1 token
openaigpt-5-mini-2025-08-07llmengine (v2)2e-06 (per 1 token)1 token
openaigpt-5-nano-2025-08-07llmengine (v2)4e-07 (per 1 token)1 token
openaio3-2025-04-16llmengine (v2)8e-06 (per 1 token)1 token
together_aiQwen/Qwen2.5-72B-Instruct-Turbollmengine (v2)1.2e-06 (per 1 token)1 token
together_aimeta-llama/Meta-Llama-3.1-8B-Instruct-Turbollmengine (v2)1.8e-07 (per 1 token)1 token
together_aimeta-llama/Meta-Llama-3.1-70B-Instruct-Turbollmengine (v2)8.8e-07 (per 1 token)1 token
together_aiQwen/Qwen2.5-72B-Instruct-Turbollmengine (v2)1.2 (per 1000000 token)1 token
together_aimeta-llama/Llama-3.3-70B-Instruct-Turbollmengine (v2)8.8e-07 (per 1 token)1 token
together_aimeta-llama/Llama-3.3-70B-Instruct-Turbollmengine (v2)8.8e-07 (per 1 token)1 token
together_aimistralai/Mixtral-8x7B-Instruct-v0.1llmengine (v2)6e-07 (per 1 token)1 token
together_aiopenai/gpt-oss-120bllmengine (v2)6e-07 (per 1 token)1 token
together_aiQwen/Qwen3-235B-A22B-Thinking-2507llmengine (v2)3e-06 (per 1 token)1 token
together_aiQwen/Qwen3-235B-A22B-fp8-tputllmengine (v2)6e-07 (per 1 token)1 token
together_aiQwen/Qwen3-Coder-480B-A35B-Instruct-FP8llmengine (v2)2e-06 (per 1 token)1 token
together_aideepseek-ai/DeepSeek-R1llmengine (v2)7e-06 (per 1 token)1 token
together_aideepseek-ai/DeepSeek-R1-0528-tputllmengine (v2)2.19e-06 (per 1 token)1 token
together_aideepseek-ai/DeepSeek-V3llmengine (v2)1.25e-06 (per 1 token)1 token
together_aideepseek-ai/DeepSeek-V3.1llmengine (v2)1.7e-06 (per 1 token)1 token
together_aimeta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8llmengine (v2)8.5e-07 (per 1 token)1 token
together_aimeta-llama/Llama-4-Scout-17B-16E-Instructllmengine (v2)5.9e-07 (per 1 token)1 token
together_aimeta-llama/Meta-Llama-3.1-405B-Instruct-Turbollmengine (v2)3.5e-06 (per 1 token)1 token
together_aimeta-llama/Meta-Llama-3.1-70B-Instruct-Turbollmengine (v2)8.8e-07 (per 1 token)1 token
together_aimeta-llama/Meta-Llama-3.1-8B-Instruct-Turbollmengine (v2)1.8e-07 (per 1 token)1 token
together_aimistralai/Mixtral-8x7B-Instruct-v0.1llmengine (v2)6e-07 (per 1 token)1 token
together_aimoonshotai/Kimi-K2-Instructllmengine (v2)3e-06 (per 1 token)1 token
together_aiopenai/gpt-oss-20bllmengine (v2)2e-07 (per 1 token)1 token
together_aizai-org/GLM-4.5-Air-FP8llmengine (v2)1.1e-06 (per 1 token)1 token
together_aizai-org/GLM-4.6llmengine (v2)2.2e-06 (per 1 token)1 token
together_aimoonshotai/Kimi-K2-Instruct-0905llmengine (v2)3e-06 (per 1 token)1 token
together_aiQwen/Qwen3-Next-80B-A3B-Instructllmengine (v2)1.5e-06 (per 1 token)1 token
together_aiQwen/Qwen3-Next-80B-A3B-Thinkingllmengine (v2)1.5e-06 (per 1 token)1 token
together_aiQwen/Qwen3-235B-A22B-Instruct-2507-tputllmengine (v2)6e-06 (per 1 token)1 token
together_aimeta-llama/Meta-Llama-3.1-405B-Instruct-Turbollmengine (v2)3.5e-06 (per 1 token)1 token
xaigrok-2-vision-1212llmengine (v2)1e-05 (per 1 token)1 token
xaigrok-4-0709llmengine (v2)1.5e-05 (per 1 token)1 token
xaigrok-4-0709llmengine (v2)1.5e-05 (per 1 token)1 token
xaigrok-2-vision-1212llmengine (v2)1e-05 (per 1 token)1 token
xaigrok-2-vision-latestllmengine (v2)1e-05 (per 1 token)1 token
xaigrok-2-visionllmengine (v2)1e-05 (per 1 token)1 token
xaigrok-2-visionllmengine (v2)1e-05 (per 1 token)1 token
xaigrok-2-vision-latestllmengine (v2)1e-05 (per 1 token)1 token
xaigrok-3llmengine (v2)1.5e-05 (per 1 token)1 token
xaigrok-3-betallmengine (v2)1.5e-05 (per 1 token)1 token
xaigrok-3-fast-betallmengine (v2)2.5e-05 (per 1 token)1 token
xaigrok-3-fast-latestllmengine (v2)2.5e-05 (per 1 token)1 token
xaigrok-3-latestllmengine (v2)1.5e-05 (per 1 token)1 token
xaigrok-3-minillmengine (v2)5e-07 (per 1 token)1 token
xaigrok-3-mini-betallmengine (v2)5e-07 (per 1 token)1 token
xaigrok-3-mini-fastllmengine (v2)4e-06 (per 1 token)1 token
xaigrok-3-mini-fast-betallmengine (v2)4e-06 (per 1 token)1 token
xaigrok-3-mini-fast-latestllmengine (v2)4e-06 (per 1 token)1 token
xaigrok-3-mini-latestllmengine (v2)5e-07 (per 1 token)1 token
xaigrok-4llmengine (v2)1.5e-05 (per 1 token)1 token
xaigrok-4-fast-reasoningllmengine (v2)5e-07 (per 1 token)1 token
xaigrok-4-fast-non-reasoningllmengine (v2)5e-07 (per 1 token)1 token
xaigrok-4-latestllmengine (v2)1.5e-05 (per 1 token)1 token
xaigrok-4-1-fastllmengine (v2)5e-07 (per 1 token)1 token
xaigrok-4-1-fast-reasoningllmengine (v2)5e-07 (per 1 token)1 token
xaigrok-4-1-fast-reasoning-latestllmengine (v2)5e-07 (per 1 token)1 token
xaigrok-4-1-fast-non-reasoningllmengine (v2)5e-07 (per 1 token)1 token
xaigrok-4-1-fast-non-reasoning-latestllmengine (v2)5e-07 (per 1 token)1 token
xaigrok-code-fastllmengine (v2)1.5e-06 (per 1 token)1 token
xaigrok-code-fast-1llmengine (v2)1.5e-06 (per 1 token)1 token
xaigrok-code-fast-1-0825llmengine (v2)1.5e-06 (per 1 token)1 token
googlegemini-2.5-flashllmengine (v2)2.5e-06 (per 1 token)1 token
googlegemini-3-pro-previewllmengine (v2)1.2e-05 (per 1 token)1 token
googlegemini-2.5-flash-image-previewllmengine (v2)2.5e-06 (per 1 token)1 token
googlegemini-3.1-pro-previewllmengine (v2)1.2e-05 (per 1 token)1 token
googlegemini-2.5-flash-litellmengine (v2)4e-07 (per 1 token)1 token
googlegemini-3-pro-image-previewllmengine (v2)12.0 (per 1000000 token)1 token
googlegemini-3-flash-previewllmengine (v2)3e-06 (per 1 token)1 token
googlegemini-2.0-flash-litellmengine (v2)3e-07 (per 1 token)1 token
googlegemini-2.0-flashllmengine (v2)4e-07 (per 1 token)1 token
googlegemini-2.0-flash-001llmengine (v2)4e-07 (per 1 token)1 token
googlegemini-2.5-flash-lite-preview-09-2025llmengine (v2)4e-07 (per 1 token)1 token
googlegemini-2.5-flash-preview-09-2025llmengine (v2)2.5e-06 (per 1 token)1 token
googlegemini-flash-latestllmengine (v2)2.5e-06 (per 1 token)1 token
googlegemini-flash-lite-latestllmengine (v2)4e-07 (per 1 token)1 token
googlegemini-2.5-prollmengine (v2)1e-05 (per 1 token)1 token
googlegemma-3-27b-itllmengine (v2)0.0 (per 1 token)1 token
groqllama-3.1-8b-instantv18e-08 (per 1 token)1 token
groqopenai/gpt-oss-120bv17.5e-07 (per 1 token)1 token
groqllama-3.3-70b-versatilev17.9e-07 (per 1 token)1 token
groqllama-3.3-70b-versatilev17.9e-07 (per 1 token)1 token
groqllama-3.1-8b-instantv18e-08 (per 1 token)1 token
groqmeta-llama/llama-guard-4-12bv12e-07 (per 1 token)1 token
groqmeta-llama/llama-4-maverick-17b-128e-instructv16e-07 (per 1 token)1 token
groqmeta-llama/llama-4-scout-17b-16e-instructv13.4e-07 (per 1 token)1 token
groqmoonshotai/kimi-k2-instruct-0905v13e-06 (per 1 token)1 token
groqopenai/gpt-oss-20bv15e-07 (per 1 token)1 token
groqqwen/qwen3-32bv15.9e-07 (per 1 token)1 token
microsoftgpt-4oAzure AI Foundry5.0 (per 1000000 token)1 token
microsofto3-miniAzure AI Foundry4.4 (per 1000000 token)1 token
microsofto1-miniAzure AI Foundry12.0 (per 1000000 token)1 token
microsoftgpt-4o-miniAzure AI Foundry0.66 (per 1000000 token)1 token
microsoftgpt-4Azure AI Foundry60.0 (per 1000000 token)1 token
microsoftgpt-35-turbo-16kAzure AI Foundry4.0 (per 1000000 token)1 token
microsoftgpt-35-turboAzure AI Foundry1.5 (per 1000000 token)1 token
minimaxminimax-m1v12.2 (per 1000000 token)1 token
minimaxminimax-text-01v11.1 (per 1000000 token)1 token
minimaxMiniMax-M2.1v11.2e-06 (per 1 token)1 token
minimaxMiniMax-M2.1-lightningv12.4e-06 (per 1 token)1 token
minimaxMiniMax-M2v11.2e-06 (per 1 token)1 token
bytedanceseed-1-6-250915llmengine (v2)2.0 (per 1000000 token)1 token
perplexityaisonarllmengine (v2)1.0 (per 1000000 token)1 token
perplexityaisonarllmengine (v2)1e-06 (per 1 token)1 token
perplexityaisonar-prollmengine (v2)1.5e-05 (per 1 token)1 token
perplexityaisonar-deep-researchllmengine (v2)8e-06 (per 1 token)1 token
perplexityaisonar-prollmengine (v2)1.5e-05 (per 1 token)1 token
perplexityaisonar-reasoning-prollmengine (v2)8e-06 (per 1 token)1 token
deepinfraGryphe/MythoMax-L2-13bv19e-08 (per 1 token)1 token
deepinfraNousResearch/Hermes-3-Llama-3.1-405Bv11e-06 (per 1 token)1 token
deepinfraNousResearch/Hermes-3-Llama-3.1-70Bv13e-07 (per 1 token)1 token
deepinfraQwen/QwQ-32Bv14e-07 (per 1 token)1 token
deepinfraQwen/Qwen2.5-72B-Instructv13.9e-07 (per 1 token)1 token
deepinfraQwen/Qwen2.5-7B-Instructv11e-07 (per 1 token)1 token
deepinfraQwen/Qwen2.5-VL-32B-Instructv16e-07 (per 1 token)1 token
deepinfraQwen/Qwen3-14Bv12.4e-07 (per 1 token)1 token
deepinfraQwen/Qwen3-235B-A22Bv15.4e-07 (per 1 token)1 token
deepinfraQwen/Qwen3-235B-A22B-Instruct-2507v16e-07 (per 1 token)1 token
deepinfraQwen/Qwen3-235B-A22B-Thinking-2507v12.9e-06 (per 1 token)1 token
deepinfraQwen/Qwen3-30B-A3Bv12.9e-07 (per 1 token)1 token
deepinfraQwen/Qwen3-32Bv12.8e-07 (per 1 token)1 token
deepinfraQwen/Qwen3-Coder-480B-A35B-Instructv11.6e-06 (per 1 token)1 token
deepinfraQwen/Qwen3-Coder-480B-A35B-Instruct-Turbov11.2e-06 (per 1 token)1 token
deepinfraQwen/Qwen3-Next-80B-A3B-Instructv11.4e-06 (per 1 token)1 token
deepinfraSao10K/L3-8B-Lunaris-v1-Turbov15e-08 (per 1 token)1 token
deepinfraSao10K/L3.1-70B-Euryale-v2.2v17.5e-07 (per 1 token)1 token
deepinfraSao10K/L3.3-70B-Euryale-v2.3v17.5e-07 (per 1 token)1 token
deepinfraanthropic/claude-3-7-sonnet-latestv11.65e-05 (per 1 token)1 token
deepinfraanthropic/claude-4-opusv18.25e-05 (per 1 token)1 token
deepinfraanthropic/claude-4-sonnetv11.65e-05 (per 1 token)1 token
deepinfradeepseek-ai/DeepSeek-R1v12.4e-06 (per 1 token)1 token
deepinfranvidia/Llama-3.3-Nemotron-Super-49B-v1.5v14e-07 (per 1 token)1 token
deepinfradeepseek-ai/DeepSeek-R1-0528v12.15e-06 (per 1 token)1 token
deepinfradeepseek-ai/DeepSeek-R1-Distill-Llama-70Bv16e-07 (per 1 token)1 token
deepinfradeepseek-ai/DeepSeek-R1-Distill-Qwen-32Bv12.7e-07 (per 1 token)1 token
deepinfradeepseek-ai/DeepSeek-R1-Turbov13e-06 (per 1 token)1 token
deepinfradeepseek-ai/DeepSeek-V3v18.9e-07 (per 1 token)1 token
deepinfradeepseek-ai/DeepSeek-V3-0324v18.8e-07 (per 1 token)1 token
deepinfradeepseek-ai/DeepSeek-V3.1v11e-06 (per 1 token)1 token
deepinfradeepseek-ai/DeepSeek-V3.1-Terminusv11e-06 (per 1 token)1 token
deepinfragoogle/gemini-2.0-flash-001v14e-07 (per 1 token)1 token
deepinfragoogle/gemini-2.5-flashv12.5e-06 (per 1 token)1 token
deepinfragoogle/gemini-2.5-prov11e-05 (per 1 token)1 token
deepinfragoogle/gemma-3-12b-itv11e-07 (per 1 token)1 token
deepinfragoogle/gemma-3-27b-itv11.6e-07 (per 1 token)1 token
deepinfragoogle/gemma-3-4b-itv18e-08 (per 1 token)1 token
deepinframeta-llama/Llama-3.2-11B-Vision-Instructv14.9e-08 (per 1 token)1 token
deepinframeta-llama/Llama-3.2-3B-Instructv12e-08 (per 1 token)1 token
deepinframeta-llama/Llama-3.3-70B-Instructv14e-07 (per 1 token)1 token
deepinframeta-llama/Llama-3.3-70B-Instruct-Turbov13.9e-07 (per 1 token)1 token
deepinframeta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8v16e-07 (per 1 token)1 token
deepinframeta-llama/Llama-4-Scout-17B-16E-Instructv13e-07 (per 1 token)1 token
deepinframeta-llama/Llama-Guard-3-8Bv15.5e-08 (per 1 token)1 token
deepinframeta-llama/Llama-Guard-4-12Bv11.8e-07 (per 1 token)1 token
deepinframeta-llama/Meta-Llama-3-8B-Instructv16e-08 (per 1 token)1 token
deepinframeta-llama/Meta-Llama-3.1-70B-Instructv14e-07 (per 1 token)1 token
deepinframeta-llama/Meta-Llama-3.1-70B-Instruct-Turbov12.8e-07 (per 1 token)1 token
deepinframeta-llama/Meta-Llama-3.1-8B-Instructv15e-08 (per 1 token)1 token
deepinframeta-llama/Meta-Llama-3.1-8B-Instruct-Turbov13e-08 (per 1 token)1 token
deepinframicrosoft/WizardLM-2-8x22Bv14.8e-07 (per 1 token)1 token
deepinframicrosoft/phi-4v11.4e-07 (per 1 token)1 token
deepinframistralai/Mistral-Nemo-Instruct-2407v14e-08 (per 1 token)1 token
deepinframistralai/Mistral-Small-24B-Instruct-2501v18e-08 (per 1 token)1 token
deepinframistralai/Mistral-Small-3.2-24B-Instruct-2506v12e-07 (per 1 token)1 token
deepinframistralai/Mixtral-8x7B-Instruct-v0.1v14e-07 (per 1 token)1 token
deepinframoonshotai/Kimi-K2-Instructv12e-06 (per 1 token)1 token
deepinframoonshotai/Kimi-K2-Instruct-0905v12e-06 (per 1 token)1 token
deepinfranvidia/Llama-3.1-Nemotron-70B-Instructv16e-07 (per 1 token)1 token
deepinfranvidia/NVIDIA-Nemotron-Nano-9B-v2v11.6e-07 (per 1 token)1 token
deepinfraopenai/gpt-oss-120bv14.5e-07 (per 1 token)1 token
deepinfraopenai/gpt-oss-20bv11.5e-07 (per 1 token)1 token
deepinfrazai-org/GLM-4.5v11.6e-06 (per 1 token)1 token
deepinfralizpreciatior/lzlv_70b_fp16_hfv19e-07 (per 1 token)1 token
deepinfraGryphe/MythoMax-L2-13bv12.2e-07 (per 1 token)1 token
deepinframistralai/Mistral-7B-Instruct-v0.1v11.3e-07 (per 1 token)1 token
deepinframeta-llama/Llama-2-70b-chat-hfv19e-07 (per 1 token)1 token
deepinfracognitivecomputations/dolphin-2.6-mixtral-8x7bv12.7e-07 (per 1 token)1 token
deepinfraPhind/Phind-CodeLlama-34B-v2v16e-07 (per 1 token)1 token
deepinframistralai/Mixtral-8x7B-Instruct-v0.1v12.7e-07 (per 1 token)1 token
deepinframeta-llama/Llama-2-13b-chat-hfv12.2e-07 (per 1 token)1 token
deepinframeta-llama/Meta-Llama-3-8B-Instructv18e-08 (per 1 token)1 token
deepinframeta-llama/Meta-Llama-3-70B-Instructv17.9e-07 (per 1 token)1 token
deepinframeta-llama/Meta-Llama-3.1-405B-Instructv19e-07 (per 1 token)1 token
deepinfraopenchat/openchat_3.5v11.3e-07 (per 1 token)1 token
deepinfradeepseek-ai/DeepSeek-R1-0528-Turbov13e-06 (per 1 token)1 token
cerebrasllama3.1-8bllmengine (v2)1e-07 (per 1 token)1 token
cerebrasgpt-oss-120bllmengine (v2)6.9e-07 (per 1 token)1 token
cloudflare@cf/meta/llama-2-7b-chat-fp16llmengine (v2)1.923e-06 (per 1 token)1 token
cloudflare@cf/meta/llama-2-7b-chat-int8llmengine (v2)1.923e-06 (per 1 token)1 token
cloudflare@cf/mistral/mistral-7b-instruct-v0.1llmengine (v2)1.923e-06 (per 1 token)1 token
cloudflare@hf/thebloke/codellama-7b-instruct-awqllmengine (v2)1.923e-06 (per 1 token)1 token
databricksdatabricks-claude-3-7-sonnetllmengine (v2)1.5e-05 (per 1 token)1 token
databricksdatabricks-claude-haiku-4-5llmengine (v2)5e-06 (per 1 token)1 token
databricksdatabricks-claude-opus-4-1llmengine (v2)7.5e-05 (per 1 token)1 token
databricksdatabricks-claude-opus-4-5llmengine (v2)2.5e-05 (per 1 token)1 token
databricksdatabricks-claude-sonnet-4llmengine (v2)1.5e-05 (per 1 token)1 token
databricksdatabricks-claude-sonnet-4-5llmengine (v2)1.5e-05 (per 1 token)1 token
databricksdatabricks-gemini-2-5-flashllmengine (v2)2.5e-06 (per 1 token)1 token
databricksdatabricks-gemini-2-5-prollmengine (v2)1e-05 (per 1 token)1 token
databricksdatabricks-gemma-3-12bllmengine (v2)5e-07 (per 1 token)1 token
databricksdatabricks-gpt-5llmengine (v2)1e-05 (per 1 token)1 token
databricksdatabricks-gpt-5-1llmengine (v2)1e-05 (per 1 token)1 token
databricksdatabricks-gpt-5-minillmengine (v2)2e-06 (per 1 token)1 token
databricksdatabricks-gpt-5-nanollmengine (v2)4e-07 (per 1 token)1 token
databricksdatabricks-gpt-oss-120bllmengine (v2)6e-07 (per 1 token)1 token
databricksdatabricks-gpt-oss-20bllmengine (v2)3e-07 (per 1 token)1 token
databricksdatabricks-llama-4-maverickllmengine (v2)1.5e-06 (per 1 token)1 token
databricksdatabricks-meta-llama-3-1-405b-instructllmengine (v2)1.5e-05 (per 1 token)1 token
databricksdatabricks-meta-llama-3-1-8b-instructllmengine (v2)4.5e-07 (per 1 token)1 token
databricksdatabricks-meta-llama-3-3-70b-instructllmengine (v2)1.5e-06 (per 1 token)1 token
fireworks_aiaccounts/fireworks/models/qwen2p5-vl-32b-instructllmengine (v2)9e-07 (per 1 token)1 token
fireworks_aiaccounts/fireworks/models/qwen3-235b-a22bllmengine (v2)8.8e-07 (per 1 token)1 token
fireworks_aiaccounts/fireworks/models/qwen3-235b-a22b-instruct-2507llmengine (v2)8.8e-07 (per 1 token)1 token
fireworks_aiaccounts/fireworks/models/qwen3-235b-a22b-thinking-2507llmengine (v2)8.8e-07 (per 1 token)1 token
fireworks_aiaccounts/fireworks/models/qwen3-8bllmengine (v2)2e-07 (per 1 token)1 token
fireworks_aiaccounts/fireworks/models/qwen3-vl-235b-a22b-instructllmengine (v2)8.8e-07 (per 1 token)1 token
fireworks_aiaccounts/fireworks/models/qwen3-vl-235b-a22b-thinkingllmengine (v2)8.8e-07 (per 1 token)1 token
fireworks_aiaccounts/fireworks/models/qwen3-vl-30b-a3b-instructllmengine (v2)6e-07 (per 1 token)1 token
fireworks_aiaccounts/fireworks/models/qwen3-vl-30b-a3b-thinkingllmengine (v2)6e-07 (per 1 token)1 token
fireworks_aiaccounts/fireworks/models/deepseek-r1-0528llmengine (v2)8e-06 (per 1 token)1 token
fireworks_aiaccounts/fireworks/models/deepseek-v3-0324llmengine (v2)9e-07 (per 1 token)1 token
fireworks_aiaccounts/fireworks/models/deepseek-v3p1llmengine (v2)1.68e-06 (per 1 token)1 token
fireworks_aiaccounts/fireworks/models/deepseek-v3p1-terminusllmengine (v2)1.68e-06 (per 1 token)1 token
fireworks_aiaccounts/fireworks/models/deepseek-v3p2llmengine (v2)1.68e-06 (per 1 token)1 token
fireworks_aiaccounts/fireworks/models/glm-4p6llmengine (v2)2.19e-06 (per 1 token)1 token
fireworks_aiaccounts/fireworks/models/gpt-oss-120bllmengine (v2)6e-07 (per 1 token)1 token
fireworks_aiaccounts/fireworks/models/gpt-oss-20bllmengine (v2)2e-07 (per 1 token)1 token
fireworks_aiaccounts/fireworks/models/kimi-k2-instruct-0905llmengine (v2)2.5e-06 (per 1 token)1 token
fireworks_aiaccounts/fireworks/models/kimi-k2-thinkingllmengine (v2)2.5e-06 (per 1 token)1 token
fireworks_aiaccounts/fireworks/models/qwen3-coder-480b-a35b-instructllmengine (v2)1.8e-06 (per 1 token)1 token
fireworks_aiaccounts/fireworks/models/llama-v3p3-70b-instructllmengine (v2)9e-07 (per 1 token)1 token
fireworks_aiaccounts/fireworks/models/minimax-m2llmengine (v2)1.2e-06 (per 1 token)1 token
fireworks_aiaccounts/fireworks/models/mixtral-8x22b-instructllmengine (v2)1.2e-06 (per 1 token)1 token
ovhcloudDeepSeek-R1-Distill-Llama-70Bllmengine (v2)6.7e-07 (per 1 token)1 token
ovhcloudLlama-3.1-8B-Instructllmengine (v2)1e-07 (per 1 token)1 token
ovhcloudMeta-Llama-3_3-70B-Instructllmengine (v2)6.7e-07 (per 1 token)1 token
ovhcloudMistral-7B-Instruct-v0.3llmengine (v2)1e-07 (per 1 token)1 token
ovhcloudMistral-Nemo-Instruct-2407llmengine (v2)1.3e-07 (per 1 token)1 token
ovhcloudMistral-Small-3.2-24B-Instruct-2506llmengine (v2)2.8e-07 (per 1 token)1 token
ovhcloudMixtral-8x7B-Instruct-v0.1llmengine (v2)6.3e-07 (per 1 token)1 token
ovhcloudQwen2.5-Coder-32B-Instructllmengine (v2)8.7e-07 (per 1 token)1 token
ovhcloudQwen2.5-VL-72B-Instructllmengine (v2)9.1e-07 (per 1 token)1 token
ovhcloudQwen3-32Bllmengine (v2)2.3e-07 (per 1 token)1 token
ovhcloudgpt-oss-120bllmengine (v2)4e-07 (per 1 token)1 token
ovhcloudgpt-oss-20bllmengine (v2)1.5e-07 (per 1 token)1 token

Default Models
NameValue
amazonamazon.nova-pro-v1:0
anthropicclaude-3-7-sonnet-latest
coherecommand-r
deepseekdeepseek-chat
metameta.llama3-1-70b-instruct-v1:0
mistralmistral-large-latest
openaigpt-4o
together_aiQwen/Qwen2.5-72B-Instruct-Turbo
xaigrok-2-latest
googlegemini-2.0-flash
groqllama-3.3-70b-versatile
microsoftgpt-4o
minimaxMiniMax-M1
bytedanceseed-1-6-250915
perplexityaisonar
deepinfranvidia/Llama-3.3-Nemotron-Super-49B-v1.5
cerebrasgpt-oss-120b
Recent Requests
Log in to see full request history
TimeStatusUser Agent
Retrieving recent requests…
LoadingLoading…
Body Params
messages
array of objects
required

A list containing all the conversations between the user and the assistant. Each item in the list should be a dictionary with two keys: 'role' and 'message'.

role: Specifies the role of the speaker and can have the values 'user', 'system', 'assistant' or 'tool'. The system role instructs the way the model should answer, e.g. 'You are a helpful assistant'. The user role specifies the user query and assistant is the model's response. The tool role is for external tools that can be used in the conversation.

message: A list of dictionaries. Each dictionary in the 'message' list must contain the keys 'type' and 'content'.

Structure

  • type: Specifies the type of content and can be 'image_url' or 'text'.
  • content: A dictionary with the actual content based on the 'type':
    • If 'type' is 'image_url', 'content' must contain 'image_url' and must not contain 'text'.
    • If 'type' is 'text', 'content' must contain 'text' and must not contain 'image_url'.

Example

[
  {
    "role": "user",
    "content": [
        {
          "type": "text",
          "text": "Describe this image"
        },
        {
        "type": "image_url",
        "image_url": {
          "url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg"
          }
        }
    ]
  }
]
messages*
string
required
length ≥ 1

The OpenAI model to use for the chat completion. This field is required and specifies which language model will process the conversation.

Example values: 'gpt-3.5-turbo', 'gpt-4', 'gpt-4-turbo'

string
enum

Choices:

  • 'low': Minimal reasoning, quick responses
  • 'medium': Balanced reasoning approach
  • 'high': In-depth, comprehensive reasoning

Example: 'high' for complex problem-solving tasks

  • low - low
  • medium - medium
  • high - high
Allowed:
metadata
array of objects

Optional list of metadata associated with the chat request. Can be used to provide additional context or tracking information.

Example:

{
  "metadata": [
    {"key": "conversation_id", "value": "chat_12345"},
    {"key": "source", "value": "customer_support"}
  ]
}
metadata
double
-2 to 2

Controls repetitiveness of model responses by penalizing frequent tokens. Ranges from -2.0 to 2.0.

Values:

  • Positive values: Reduce token repetition
  • Negative values: Encourage repetition
  • 0.0: Default behavior

Example: 1.5 to significantly reduce repeated phrases

logit_bias
object

Modify the likelihood of specific tokens appearing in the response. A dictionary where keys are token IDs and values are bias scores.

Example:

{
  "logit_bias": {
    "50256": -100,  # Reduce probability of end-of-text token
    "15": 5  # Slightly increase probability of a specific token
  }
}
boolean

If set to True, returns log probabilities of the most likely tokens. Useful for advanced token probability analysis.

Example: True to get detailed token likelihood information

integer
0 to 20

Number of top log probabilities to return with each token. Must be between 0 and 20.

Example: 5 to get top 5 most likely tokens for each position

integer
≥ 1

Maximum number of tokens to generate in the completion. Must be at least 1.

Example: 150 to limit response to approximately 100-150 words

integer
≥ 1

Number of chat completion choices to generate.

**Example**: 3 to generate multiple alternative responses
modalities
array of strings

List of supported input/output modalities for the chat.

Example:

{
  "modalities": ["text", "image", "audio"]
}
modalities
prediction
object

Optional field for storing prediction-related information. Flexible dictionary to capture model's predictive metadata.

Example:

{
  "prediction": {
    "confidence_score": 0.85,
    "top_prediction": "response_category"
  }
}
audio
object

Optional dictionary for audio-related parameters or metadata.

Example:

{
  "audio": {
    "language": "en-US",
    "transcription_format": "srt"
  }
}
double
-2 to 2

Adjusts likelihood of discussing new topics by penalizing existing tokens. Ranges from -2.0 to 2.0.

Values:

  • Positive values: Encourage more diverse topics
  • Negative values: Keep discussion more focused
  • 0.0: Default behavior

Example: 1.0 to promote topic diversity

response_format
object

Specify the desired response format for the completion.

Example:

{
  "response_format": {
    "type": "json_object",
    "schema": {...}
  }
}
integer

Set a seed for deterministic sampling to reproduce consistent results.

Example: 42 for a reproducible random generation process

string
enum

Choices:

  • 'auto': Automatically select appropriate tier
  • 'default': Use default service configuration
  • auto - auto
  • default - default
Allowed:
stop
array of strings

List of strings that will cause the model to stop generating.

Example:

{
  "stop": ["\n", "Human:", "AI:"]
}
stop
boolean
Defaults to false

If True, returns tokens as they are generated in a streaming format. Default is False.

Example: True for real-time token streaming

stream_options
object

Additional configuration for streaming responses.

Example:

{
  "stream_options": {
    "include_usage": true
  }
}
double
0 to 2

Controls randomness in token selection. Ranges from 0.0 to 2.0.

Values:

  • 0.0: Most deterministic, focused responses
  • 1.0: Balanced randomness
  • 2.0: Most creative, unpredictable responses

Example: 0.7 for a good balance of creativity and focus

double
0 to 1

Nucleus sampling threshold for token selection. Ranges from 0.0 to 1.0. Default is 1.0.

Values:

  • 1.0: Consider all tokens
  • Lower values: More focused, deterministic sampling

Example: 0.9 to select from top 90% most probable tokens

tools
array

List of tools or function definitions available to the model.

Example:

{
  "tools": [
    {
      "type": "function",
      "function": {
        "name": "get_weather",
        "description": "Retrieve current weather"
      }
    }
  ]
}
tools
string
length ≥ 1

Specify how tools should be used in the completion.

Example values:

  • 'auto': Model decides when to use tools
  • 'none': Disable tool usage
  • Specific tool name to always use a particular tool
boolean

Allow the model to make multiple tool calls in parallel.

Example: True to enable concurrent tool invocations

string
length ≥ 1

Optional identifier for the end-user to help track and monitor API usage.

Example: 'user_123456'

string
length ≥ 1

Control how function calls are handled.

Example values:

  • 'auto': Default behavior
  • 'none': Disable function calls
  • Specific function name to force its execution
functions
array of objects

List of function definitions available to the model.

Example:

{
  "functions": [
    {
      "name": "get_current_weather",
      "description": "Get the current weather for a location",
      "parameters": {...}
    }
  ]
}
functions
thinking
object

Configuration for enabling Claude's extended thinking. When enabled, responses include thinking content blocks showing Claude's thinking process before the final answer. Requires a minimum budget of 1,024 tokens and counts towards your max_tokens limit.

Example:

{
  'thinking': {
    'type': 'enabled'
    'budget_tokens': '1024'  }
}
web_search_options
object

Options for web search integration. Example: json web_search_options={ "search_context_size": "medium" # Options: "low", "medium", "high" }

Responses

Language
Credentials
Bearer
JWT
LoadingLoading…
Response
Click Try It! to start a request and see the response here! Or choose an example:
application/json