Input token counts and costs for over 160 AI models.

166 AI models to be exact. The entire list is below.

No credit card required.

Author Model Price per Input Token
AI21
Jamba 1.5 Large $0.000002
Jamba 1.5 Mini $0.0000002
Amazon
Titan Text Premier $0.0000005
Titan Text Express $0.0000002
Titan Text Lite $0.00000015
Titan Text Embeddings $0.0000001
Titan Text Embeddings V2 $0.00000002
Anthropic
Claude 2.0 $0.000008
Claude 2.1 $0.000008
Claude Instant 1.2 $0.0000008
Claude 3 Haiku $0.00000025
Claude 3 Opus $0.000015
Claude 3 Sonnet $0.000003
Claude 3.5 Haiku $0.0000008
Claude 3.5 Sonnet $0.000003
Cohere
Command R $0.00000015
Command R 08-2024 $0.00000015
Command R+ $0.0000025
Command R+ 08-2024 $0.0000025
Databricks
Dolly V1 6B n/a
Dolly V2 3B n/a
Dolly V2 7B n/a
Dolly V2 12B n/a
DeepSeek
DeepSeek LLM 7B $0.00000014
DeepSeek LLM 7B Chat $0.00000014
DeepSeek LLM 67B $0.00000014
DeepSeek LLM 67B Chat $0.00000014
DeepSeek V1 1.3B $0.00000014
DeepSeek V1 1.3B Chat $0.00000014
DeepSeek V1 7B $0.00000014
DeepSeek V1 7B Chat $0.00000014
DeepSeek V2 $0.00000014
DeepSeek V2 Chat $0.00000014
DeepSeek V2 Chat 0628 $0.00000014
DeepSeek V2 Lite $0.00000014
DeepSeek V2 Lite Chat $0.00000014
DeepSeek V2.5 $0.00000014
Google
Gemini 1.0 Nano n/a
Gemini 1.0 Pro $0.0000005
Gemini 1.5 Pro $0.00000125
Gemini 1.5 Flash $0.000000075
Gemini 1.5 Flash 8B $0.0000000375
Gemma 2B n/a
Gemma 7B n/a
Gemma 2 2B n/a
Gemma 2 9B n/a
Gemma 2 27B n/a
CodeGemma 2B n/a
CodeGemma 7B n/a
ShieldGemma 2B n/a
ShieldGemma 9B n/a
ShieldGemma 27B n/a
IBM
Granite 7B n/a
Granite 3B Code 2K n/a
Granite 3B Code 128K n/a
Granite 8B Code 4K n/a
Granite 8B Code 128K n/a
Granite 20B Code 8K n/a
Granite 34B Code 8K n/a
Granite 3.0 2B n/a
Granite 3.0 8B n/a
Granite Guardian HAP 38M n/a
Granite Guardian HAP 125M n/a
Granite Guardian 3.0 2B n/a
Granite Guardian 3.0 8B n/a
Meta
Llama 2 7B $0.00000052
Llama 2 7B Chat $0.00000052
Llama 2 13B $0.00000052
Llama 2 13B Chat $0.00000081
Llama 2 70B $0.00000052
Llama 2 70B Chat $0.00000177
Llama 3 8B n/a
Llama 3 70B n/a
Llama 3.1 8B n/a
Llama 3.1 70B n/a
Llama 3.1 405B n/a
Llama 3.2 1B n/a
Llama 3.2 3B n/a
Llama 3.2 11B Vision n/a
Llama 3.2 90B Vision n/a
CodeLlama 7B n/a
CodeLlama 13B n/a
CodeLlama 34B n/a
CodeLlama 70B n/a
Llama Guard 7B n/a
Llama Guard 2 8B n/a
Llama Guard 3 1B n/a
Llama Guard 3 8B n/a
Llama Guard 3 11B Vision n/a
Prompt Guard 86M n/a
Microsoft
Phi-3 Mini 4K $0.00000013
Phi-3 Mini 128K $0.00000013
Phi-3 Medium 4K $0.00000017
Phi-3 Medium 128K $0.00000017
Phi-3 Vision 128K n/a
Phi-3.5 Mini $0.00000013
Phi-3.5 Vision $0.00000013
Mistral
Mistral 7B 0.1 $0.00000025
Mistral 7B 0.3 $0.00000025
Mistral Large $0.000002
Mistral Nemo $0.00000015
Codestral 22B 0.1 $0.0000002
Codestral Mamba 7B 0.1 $0.0000002
Mathstral 7B 0.1 n/a
Nvidia
NVLM D 72B n/a
OpenAI
GPT-3.5 Turbo $0.000001
GPT-3.5 Turbo 16K $0.000003
GPT-4 $0.00003
GPT-4 Turbo $0.00001
GPT-4o $0.0000025
GPT-4o Mini $0.00000015
Text Embedding Ada 002 $0.0000001
Text Embedding 3 Small $0.00000002
Text Embedding 3 Large $0.00000013
Qwen
Qwen 1.5 0.5B n/a
Qwen 1.5 0.5B Chat n/a
Qwen 1.5 1.8B n/a
Qwen 1.5 1.8B Chat n/a
Qwen 1.5 4B n/a
Qwen 1.5 4B Chat n/a
Qwen 1.5 7B n/a
Qwen 1.5 7B Chat n/a
Qwen 1.5 14B n/a
Qwen 1.5 14B Chat n/a
Qwen 1.5 32B n/a
Qwen 1.5 32B Chat n/a
Qwen 1.5 72B n/a
Qwen 1.5 72B Chat n/a
Qwen 1.5 110B n/a
Qwen 1.5 110B Chat n/a
Qwen 2 0.5B n/a
Qwen 2 1.5B n/a
Qwen 2 7B n/a
Qwen 2 72B n/a
Qwen 2.5 0.5B n/a
Qwen 2.5 1.5B n/a
Qwen 2.5 3B n/a
Qwen 2.5 7B n/a
Qwen 2.5 14B n/a
Qwen 2.5 32B n/a
Qwen 2.5 72B n/a
Snowflake
Arctic n/a
Arctic Embed Extra Small n/a
Arctic Embed Small n/a
Arctic Embed Medium n/a
Arctic Embed Medium Long n/a
Arctic Embed Medium V1.5 n/a
Arctic Embed Large n/a
Stability
Stable LM 2 12B n/a
Stable LM 2 12B Chat n/a
Stable LM Zephyr 3B n/a
Stable Beluga 7B n/a
Stable Beluga 13B n/a
Stable Beluga 2 n/a
Stable Code 3B n/a
TIIUAE
Falcon 7B n/a
Falcon 11B n/a
Falcon 40B n/a
Falcon 180B n/a
Falcon 180B Chat n/a
Falcon Mamba 7B n/a
xAI
Grok-1 n/a
Zyphra
Zamba 7B n/a
Zamba 2 1.2B n/a
Zamba 2 2.7B n/a
Zamba 2 7B n/a
Jamba 1.5 Large $0.000002
Jamba 1.5 Mini $0.0000002
Titan Text Premier $0.0000005
Titan Text Express $0.0000002
Titan Text Lite $0.00000015
Titan Text Embeddings $0.0000001
Titan Text Embeddings V2 $0.00000002
Claude 2.0 $0.000008
Claude 2.1 $0.000008
Claude Instant 1.2 $0.0000008
Claude 3 Haiku $0.00000025
Claude 3 Opus $0.000015
Claude 3 Sonnet $0.000003
Claude 3.5 Haiku $0.0000008
Claude 3.5 Sonnet $0.000003
Command R $0.00000015
Command R 08-2024 $0.00000015
Command R+ $0.0000025
Command R+ 08-2024 $0.0000025
Dolly V1 6B n/a
Dolly V2 3B n/a
Dolly V2 7B n/a
Dolly V2 12B n/a
DeepSeek LLM 7B $0.00000014
DeepSeek LLM 7B Chat $0.00000014
DeepSeek LLM 67B $0.00000014
DeepSeek LLM 67B Chat $0.00000014
DeepSeek V1 1.3B $0.00000014
DeepSeek V1 1.3B Chat $0.00000014
DeepSeek V1 7B $0.00000014
DeepSeek V1 7B Chat $0.00000014
DeepSeek V2 $0.00000014
DeepSeek V2 Chat $0.00000014
DeepSeek V2 Chat 0628 $0.00000014
DeepSeek V2 Lite $0.00000014
DeepSeek V2 Lite Chat $0.00000014
DeepSeek V2.5 $0.00000014
Gemini 1.0 Nano n/a
Gemini 1.0 Pro $0.0000005
Gemini 1.5 Pro $0.00000125
Gemini 1.5 Flash $0.000000075
Gemini 1.5 Flash 8B $0.0000000375
Gemma 2B n/a
Gemma 7B n/a
Gemma 2 2B n/a
Gemma 2 9B n/a
Gemma 2 27B n/a
CodeGemma 2B n/a
CodeGemma 7B n/a
ShieldGemma 2B n/a
ShieldGemma 9B n/a
ShieldGemma 27B n/a
Granite 7B n/a
Granite 3B Code 2K n/a
Granite 3B Code 128K n/a
Granite 8B Code 4K n/a
Granite 8B Code 128K n/a
Granite 20B Code 8K n/a
Granite 34B Code 8K n/a
Granite 3.0 2B n/a
Granite 3.0 8B n/a
Granite Guardian HAP 38M n/a
Granite Guardian HAP 125M n/a
Granite Guardian 3.0 2B n/a
Granite Guardian 3.0 8B n/a
Llama 2 7B $0.00000052
Llama 2 7B Chat $0.00000052
Llama 2 13B $0.00000052
Llama 2 13B Chat $0.00000081
Llama 2 70B $0.00000052
Llama 2 70B Chat $0.00000177
Llama 3 8B n/a
Llama 3 70B n/a
Llama 3.1 8B n/a
Llama 3.1 70B n/a
Llama 3.1 405B n/a
Llama 3.2 1B n/a
Llama 3.2 3B n/a
Llama 3.2 11B Vision n/a
Llama 3.2 90B Vision n/a
CodeLlama 7B n/a
CodeLlama 13B n/a
CodeLlama 34B n/a
CodeLlama 70B n/a
Llama Guard 7B n/a
Llama Guard 2 8B n/a
Llama Guard 3 1B n/a
Llama Guard 3 8B n/a
Llama Guard 3 11B Vision n/a
Prompt Guard 86M n/a
Phi-3 Mini 4K $0.00000013
Phi-3 Mini 128K $0.00000013
Phi-3 Medium 4K $0.00000017
Phi-3 Medium 128K $0.00000017
Phi-3 Vision 128K n/a
Phi-3.5 Mini $0.00000013
Phi-3.5 Vision $0.00000013
Mistral 7B 0.1 $0.00000025
Mistral 7B 0.3 $0.00000025
Mistral Large $0.000002
Mistral Nemo $0.00000015
Codestral 22B 0.1 $0.0000002
Codestral Mamba 7B 0.1 $0.0000002
Mathstral 7B 0.1 n/a
NVLM D 72B n/a
GPT-3.5 Turbo $0.000001
GPT-3.5 Turbo 16K $0.000003
GPT-4 $0.00003
GPT-4 Turbo $0.00001
GPT-4o $0.0000025
GPT-4o Mini $0.00000015
Text Embedding Ada 002 $0.0000001
Text Embedding 3 Small $0.00000002
Text Embedding 3 Large $0.00000013
Qwen 1.5 0.5B n/a
Qwen 1.5 0.5B Chat n/a
Qwen 1.5 1.8B n/a
Qwen 1.5 1.8B Chat n/a
Qwen 1.5 4B n/a
Qwen 1.5 4B Chat n/a
Qwen 1.5 7B n/a
Qwen 1.5 7B Chat n/a
Qwen 1.5 14B n/a
Qwen 1.5 14B Chat n/a
Qwen 1.5 32B n/a
Qwen 1.5 32B Chat n/a
Qwen 1.5 72B n/a
Qwen 1.5 72B Chat n/a
Qwen 1.5 110B n/a
Qwen 1.5 110B Chat n/a
Qwen 2 0.5B n/a
Qwen 2 1.5B n/a
Qwen 2 7B n/a
Qwen 2 72B n/a
Qwen 2.5 0.5B n/a
Qwen 2.5 1.5B n/a
Qwen 2.5 3B n/a
Qwen 2.5 7B n/a
Qwen 2.5 14B n/a
Qwen 2.5 32B n/a
Qwen 2.5 72B n/a
Arctic n/a
Arctic Embed Extra Small n/a
Arctic Embed Small n/a
Arctic Embed Medium n/a
Arctic Embed Medium Long n/a
Arctic Embed Medium V1.5 n/a
Arctic Embed Large n/a
Stable LM 2 12B n/a
Stable LM 2 12B Chat n/a
Stable LM Zephyr 3B n/a
Stable Beluga 7B n/a
Stable Beluga 13B n/a
Stable Beluga 2 n/a
Stable Code 3B n/a
Falcon 7B n/a
Falcon 11B n/a
Falcon 40B n/a
Falcon 180B n/a
Falcon 180B Chat n/a
Falcon Mamba 7B n/a
Grok-1 n/a
Zamba 7B n/a
Zamba 2 1.2B n/a
Zamba 2 2.7B n/a
Zamba 2 7B n/a

Prompt engineering made simple.

Our platform helps AI engineers build better prompts, faster — saving time, reducing costs, and improving AI outcomes.

No credit card required.

Purpose-built for AI engineers.

Empowering AI engineers with innovative tools to streamline development, increase productivity, and improve results.

Prompt Library
Organize and manage your prompts in shared workspaces for easy access and collaboration.
Prompt Versioning
Create multiple versions of your prompts, with each optimizable to improve AI performance.
Prompt Generation
Auto-generate high quality, immediately usable prompts based on your use case to save hours of time.
Prompt Scoring
Get scores for your prompts based on predefined and custom sets of criteria, scored 0-100.
Prompt Balance
Gain insights and recommendations about your prompt structure based on phrase categorization.
Prompt Heatmaps
Visualize which phrases in your prompts are given the most (or least) attention by AI models.