Loading...
Loading...
Groq is known for its ultra-fast inference speeds powered by custom LPU (Language Processing Unit) hardware. Their API provides access to popular open-source models like Llama, Mixtral, and Gemma with industry-leading latency. Ideal for real-time applications.
Models
Pricing
| Model | Input Price | Output Price | Context Window |
|---|---|---|---|
| Llama 3.3 70B | $0.59 | $0.79 | 128.0K |
| Llama 3.1 8B | $0.05 | $0.08 | 128.0K |
| Mixtral 8x7B | $0.24 | $0.24 | 32.8K |
Test Groq APIs directly in our interactive playground with your own API key.
Category
Pricing Model
Pay per TokenPricing
Input Price
$0.59 per 1M tokens
Output Price
$0.79 per 1M tokens
Context Window
128.0K tokens
Country
US