All LLM APIs

streamingfunction-callingtool-use+9 more

Anthropic

Creator of Claude — safety-focused AI with strong reasoning and long context.

USEst. 20211 endpoints

streamingtool-usevision+4 more

streamingfunction-callingvision+7 more

Google DeepMind

Gemini models — natively multimodal with strong performance across tasks.

USEst. 20101 endpoints

Multi-modalFree Tier

DeepSeek

DeepSeek-V4 series: flagship V4 Pro for best reasoning, V4 Flash for speed and efficiency.

streamingfunction-callingjson-mode+2 more

Meta AI

Open-source Llama models — powerful, free, and community-driven.

USEst. 20041 endpoints

ChatFree Tier

Mistral AI

European AI leader with efficient and powerful open-weight models.

FREst. 20231 endpoints

Cohere

Enterprise-grade AI platform with strong RAG and embedding capabilities.

CAEst. 20191 endpoints

streamingtool-usejson-mode+5 more

Zhipu AI

Leading Chinese AI company with GLM series models and strong enterprise adoption.

CNEst. 20191 endpoints

Moonshot AI

Creator of Kimi — China's popular long-context AI assistant with 2M token support.

streamingfunction-callingtool-use+4 more

streamingfunction-callingtool-use+7 more

Alibaba Cloud

Tongyi Qianwen (Qwen) models with strong multilingual and multimodal support.

CNEst. 19991 endpoints

Multi-modalPay per Token

Baidu AI

ERNIE Bot — Baidu's flagship LLM with deep Chinese language understanding.

CNEst. 20001 endpoints

streamingfunction-callingvision+4 more

MiniMax

Chinese AI unicorn with abab series models and Hailuo AI productivity tools.

CNEst. 20211 endpoints

streamingfunction-callingfine-tuning+2 more

streamingfunction-callingvision+3 more

xAI

Elon Musk's AI company — creator of Grok with real-time knowledge and witty personality.

USEst. 20231 endpoints

ChatSubscription

Perplexity

AI-powered answer engine with real-time web search and citation-backed responses.

streamingweb-searchcitation+1 more

Together AI

Fast inference platform for open-source models with competitive pricing.

streamingfunction-callingjson-mode+3 more

Baichuan AI

Chinese AI company with strong Baichuan models optimized for enterprise use cases.

streamingfunction-callingjson-mode+4 more

Image GenerationPay per Token

Stability AI

Pioneer in open generative AI for images, video, and audio.

UKEst. 20191 endpoints

streamingfine-tuning

StepFun

Chinese AI company with Step series models focusing on multimodal understanding.

streamingfunction-callingvision+3 more

Multi-modalPay per Token

streamingfine-tuningimage-input+3 more

Replicate

Cloud platform for running open-source ML models with simple API.

USEst. 20191 endpoints

ChatUsage Based

Groq

Lightning-fast inference with LPU hardware — lowest latency in the market.

USEst. 20161 endpoints

streamingfunction-callingtool-use+3 more

OpenRouter

Unified API gateway for 200+ LLM models with a single API key and standardized format.

USEst. 20231 endpoints

API GatewayPay per Token

DeepInfra

Fast and affordable serverless inference for open-source LLMs with pay-per-token pricing.

streamingjson-modeembeddings+1 more

API GatewayPay per Token

Fireworks AI

Enterprise-grade fast inference platform with fine-tuning and LoRA support for open models.

streamingfunction-callingtool-use+6 more

API GatewayPay per Token

streamingembeddingsimage-input+3 more

Hugging Face

The world's largest ML model hub with hosted inference API for thousands of models.

USEst. 20161 endpoints

API GatewayFree Tier

streamingjson-modeembeddings+3 more

Cloudflare Workers AI

Edge inference platform running models on Cloudflare's global network with low latency.

USEst. 20101 endpoints

API GatewayFree Tier