Loading...
Loading...
Browse and compare all available Large Language Model APIs. Use filters to find the perfect match.
Category
Pricing Model
Capabilities
25 APIs found
Creator of GPT-4, GPT-4o, and the most widely used LLM API platform.
Creator of Claude — safety-focused AI with strong reasoning and long context.
Gemini models — natively multimodal with strong performance across tasks.
DeepSeek-V4 series: flagship V4 Pro for best reasoning, V4 Flash for speed and efficiency.
Open-source Llama models — powerful, free, and community-driven.
European AI leader with efficient and powerful open-weight models.
Enterprise-grade AI platform with strong RAG and embedding capabilities.
Leading Chinese AI company with GLM series models and strong enterprise adoption.
Creator of Kimi — China's popular long-context AI assistant with 2M token support.
Tongyi Qianwen (Qwen) models with strong multilingual and multimodal support.
ERNIE Bot — Baidu's flagship LLM with deep Chinese language understanding.
Chinese AI unicorn with abab series models and Hailuo AI productivity tools.
Elon Musk's AI company — creator of Grok with real-time knowledge and witty personality.
AI-powered answer engine with real-time web search and citation-backed responses.
Fast inference platform for open-source models with competitive pricing.
Chinese AI company with strong Baichuan models optimized for enterprise use cases.
Pioneer in open generative AI for images, video, and audio.
Chinese AI company with Step series models focusing on multimodal understanding.
Cloud platform for running open-source ML models with simple API.
Lightning-fast inference with LPU hardware — lowest latency in the market.
Unified API gateway for 200+ LLM models with a single API key and standardized format.
Fast and affordable serverless inference for open-source LLMs with pay-per-token pricing.
Enterprise-grade fast inference platform with fine-tuning and LoRA support for open models.
The world's largest ML model hub with hosted inference API for thousands of models.
Edge inference platform running models on Cloudflare's global network with low latency.