LLMEndpoint

LLM API Provider Directory

Browse official APIs, inference platforms, aggregators, and OpenAI-compatible providers by model support, pricing notes, capabilities, and transparency signals.

Short answer

Use the directory for broad market scanning, not final selection. Filter down to a small shortlist, then switch to compare pages, provider reviews, and cost modeling before choosing a production provider.

22 providers indexed across official APIs, inference platforms, aggregators, and OpenAI-compatible endpoints.

Use the category chips for a quick pass, then switch to detailed filters when you are narrowing a shortlist.

8Official APIs

Usually strongest for direct vendor trust

7Inference providers

Often useful for cost and speed research

17OpenAI-compatible

Useful for migration and fallback planning

Start from the path that sounds like your job

If the full table feels too broad, use one of these entry points first.

I want the safest production starting point

Start in Official APIs first, then compare OpenAI, Anthropic, and Gemini before exploring cheaper or more flexible routes.

Start with official APIs

I want lower-cost or faster open-model routes

Go to inference providers if your main goal is cheaper open-model serving, speed experiments, or broader model access.

Browse inference providers

I want OpenAI-compatible migration options

Use the compatibility category if the team wants a lower-friction migration path before rewriting app logic.

Open compatibility options

I want routing, fallback, or gateway control

Go to aggregators and gateways if your problem is multi-provider operations rather than picking one direct model vendor.

See aggregators

How to use the directory well

The fastest path is filter, shortlist, compare, then model cost.

Workflow

Start from your app's use case

Use the finder first if you are not yet sure which provider category belongs in the shortlist.

Use finder for category direction

Workflow

Compare finalists after filtering

Once the directory narrows the field, move into side-by-side comparisons rather than staying in the table.

Shift from browse mode to compare mode

Workflow

Turn browsing into a shortlist process

Use the shortlist guide when you need a more structured path from directory research to a buying decision.

Use a shortlist framework

22 providers match the current filters.

Use category chips for a quick pass, then combine filters to narrow the shortlist.

OpenAI-compatible Tool calling Vision Low-cost option

Provider	Category	Supported models	OpenAI-compatible	Starting price	Context	Tool calling	Vision	Streaming	Status	Trust	Links
OpenAI	Official APIs	GPT, reasoning models, embeddings, image	Yes	Budget to premium GPT tiers	Short to very long, model based	Yes	Yes	Yes	Available	12/15	Review Docs Compare
Anthropic	Official APIs	Claude, Claude Haiku, Claude Sonnet, Claude Opus	No	Mid to premium Claude tiers	Long context options	Yes	Yes	Yes	Available	10/15	Review Docs Compare
Google Gemini	Official APIs	Gemini, embedding models, multimodal models	Yes	Low-cost flash to premium tiers	Short to million-token-class options	Yes	Yes	Yes	Available	11/15	Review Docs Compare
Mistral AI	Official APIs	Mistral, Mixtral, Codestral, embeddings	Yes	Open and premium model tiers	Short to long, model based	Yes	No	Yes	Available	11/15	Review Docs
DeepSeek	Official APIs	DeepSeek-V4-Flash, DeepSeek-V4-Pro	Yes	Low-cost flash to discounted pro tiers	1M context, up to 384K output	Yes	No	Yes	Available	11/15	Review Docs Compare
xAI	Official APIs	Grok	Yes	Frontier-model pricing tiers	Mid to long, model based	Yes	Yes	Yes	Available	11/15	Review Docs
Cohere	Official APIs	Command, Embed, Rerank	No	Enterprise and task-specific tiers	Task and model based	Yes	No	Yes	Available	10/15	Review Docs
Together AI	Inference Providers	Llama, Qwen, DeepSeek-V4, Mistral	Yes	Often competitive for open models	Broad open-model range	No	Yes	Yes	Available	11/15	Review Docs Compare
Fireworks AI	Inference Providers	Llama, Qwen, DeepSeek-V4, Mistral	Yes	Competitive serverless tiers for open models	Broad open-model range, model specific	No	Yes	Yes	Available	11/15	Review Docs
Groq	Inference Providers	Llama, Mixtral, Gemma, Whisper-like speech models	Yes	Speed-oriented model tiers	Selected fast-serving model range, model specific	Yes	No	Yes	Available	11/15	Review Docs Compare
DeepInfra	Inference Providers	Llama, Qwen, DeepSeek-V4, Mistral	Yes	Often low for open models	Broad open-model range, model specific	No	Yes	Yes	Available	10/15	Review Docs Compare
Replicate	Inference Providers	open models, image models, audio models, video models	No	Runtime dependent	Model dependent	No	Yes	Yes	Available	10/15	Review Docs
Baseten	Inference Providers	custom models, open models	No	Deployment dependent	Model dependent	No	Yes	Yes	Available	10/15	Review Docs
OpenRouter	LLM API Aggregators	GPT, Claude, Gemini, DeepSeek-V4	Yes	Varies by model route	Model dependent across upstream routes	No	Yes	Yes	Available	11/15	Review Docs Compare
Portkey	LLM API Aggregators	GPT, Claude, Gemini, DeepSeek-V4	Yes	Plan dependent plus provider spend	Provider dependent	No	Yes	Yes	Available	11/15	Review Docs
LiteLLM Cloud	LLM API Aggregators	GPT, Claude, Gemini, Mistral	Yes	Plan dependent	Provider dependent	No	Yes	Yes	Unclear	10/15	Review Docs
Helicone	LLM API Aggregators	provider dependent	Yes	Plan dependent	Provider dependent	No	Yes	Yes	Available	11/15	Review Docs
Perplexity API	OpenAI-Compatible APIs	Sonar, online models	Yes	Varies by model	Model dependent	No	No	Yes	Available	10/15	Review Docs
Novita AI	OpenAI-Compatible APIs	Llama, Qwen, DeepSeek-V4, image models	Yes	Varies by model	Model dependent	No	Yes	Yes	Unclear	10/15	Review Docs
AI/ML API	OpenAI-Compatible APIs	GPT-style models, Claude-style access, Gemini-style access, open models	Yes	Varies by model	Model dependent	No	Yes	Yes	Unclear	9/15	Review Docs
Anyscale Endpoints	Inference Providers	open models, custom deployments	Yes	Unclear	Model dependent	No	No	Yes	Unclear	10/15	Review Docs
Voyage AI	Official APIs	embeddings, rerankers	No	Varies by model	Model dependent	No	No	No	Unclear	9/15	Review Docs

Most practical starting points

These are common providers when teams want one strong baseline in each decision direction.

Official APIs

OpenAI

Official API for GPT models, multimodal capabilities, embeddings, realtime use cases, and broad developer tooling.

general AI apps
Budget to premium GPT tiers
Trust 12/15

Review Estimate cost

Official APIs

Anthropic

Official Claude API with strong long-context, coding, writing, and agent-oriented use cases.

coding
Mid to premium Claude tiers
Trust 10/15

Review Estimate cost

Official APIs

DeepSeek

Official DeepSeek API for the DeepSeek-V4 family, with OpenAI and Anthropic compatible formats plus very large context windows.

cost-effective long-context apps
Low-cost flash to discounted pro tiers
Trust 11/15

Review Estimate cost

LLM API Aggregators

OpenRouter

Unified API for accessing many models and providers through a routing and marketplace-style interface.

model comparison
Varies by model route
Trust 11/15

Review Estimate cost

Inference Providers

DeepInfra

Serverless inference platform with a broad model catalog and OpenAI-compatible endpoints for many models.

low-cost open model inference
Often low for open models
Trust 10/15

Review Estimate cost

Compare these next

Once the table has helped you narrow the field, move into head-to-head decisions.

OpenAI vs AnthropicBest starting comparison for many production teams

OpenAI vs DeepSeekUseful if DeepSeek-V4 pricing and 1M context are part of the shortlist

OpenAI vs Google GeminiUseful if multimodal and ecosystem fit matter

Groq vs DeepInfraUseful if you are weighing speed against cheap open-model breadth