Loading...
Loading...
Cloudflare Workers AI runs machine learning models on Cloudflare's global edge network, providing low-latency inference close to users worldwide. Supports text generation, embeddings, image generation, and speech-to-text with a simple REST API. Pay-per-use pricing with generous free tier.
/v1/chat/completionsGenerate text using models running on Cloudflare edge network.
Models
Pricing
| Model | Input Price | Output Price | Context Window |
|---|---|---|---|
| Llama 3.1 8B (free: 1B tokens/day) | Free | Free | 32.0K |
| Llama 3.3 70B (paid) | Free | Free | 128.0K |
Test Cloudflare Workers AI APIs directly in our interactive playground with your own API key.
Category
Pricing Model
Free TierPricing
Input Price
Free per 1M tokens
Output Price
Free per 1M tokens
Context Window
32.0K tokens
Country
US