LLMEndpoint

LLM API Provider Directory

Browse official APIs, inference platforms, aggregators, and OpenAI-compatible providers by model support, pricing notes, capabilities, and transparency signals.

Short answer

Use the directory for broad market scanning, not final selection. Filter down to a small shortlist, then switch to compare pages, provider reviews, and cost modeling before choosing a production provider.

22 providers indexed across official APIs, inference platforms, aggregators, and OpenAI-compatible endpoints.

Use the category chips for a quick pass, then switch to detailed filters when you are narrowing a shortlist.

8Official APIs

Usually strongest for direct vendor trust

7Inference providers

Often useful for cost and speed research

17OpenAI-compatible

Useful for migration and fallback planning

Start from the path that sounds like your job

If the full table feels too broad, use one of these entry points first.

I want the safest production starting point

Start in Official APIs first, then compare OpenAI, Anthropic, and Gemini before exploring cheaper or more flexible routes.

I want lower-cost or faster open-model routes

Go to inference providers if your main goal is cheaper open-model serving, speed experiments, or broader model access.

I want OpenAI-compatible migration options

Use the compatibility category if the team wants a lower-friction migration path before rewriting app logic.

I want routing, fallback, or gateway control

Go to aggregators and gateways if your problem is multi-provider operations rather than picking one direct model vendor.

22 providers match the current filters.

Use category chips for a quick pass, then combine filters to narrow the shortlist.

ProviderCategorySupported modelsOpenAI-compatibleStarting priceContextTool callingVisionStreamingStatusTrustLinks
OpenAIOfficial APIsGPT, reasoning models, embeddings, imageYesBudget to premium GPT tiersShort to very long, model basedYesYesYesAvailable12/15
AnthropicOfficial APIsClaude, Claude Haiku, Claude Sonnet, Claude OpusNoMid to premium Claude tiersLong context optionsYesYesYesAvailable10/15
Google GeminiOfficial APIsGemini, embedding models, multimodal modelsYesLow-cost flash to premium tiersShort to million-token-class optionsYesYesYesAvailable11/15
Mistral AIOfficial APIsMistral, Mixtral, Codestral, embeddingsYesOpen and premium model tiersShort to long, model basedYesNoYesAvailable11/15
DeepSeekOfficial APIsDeepSeek-V4-Flash, DeepSeek-V4-ProYesLow-cost flash to discounted pro tiers1M context, up to 384K outputYesNoYesAvailable11/15
xAIOfficial APIsGrokYesFrontier-model pricing tiersMid to long, model basedYesYesYesAvailable11/15
CohereOfficial APIsCommand, Embed, RerankNoEnterprise and task-specific tiersTask and model basedYesNoYesAvailable10/15
Together AIInference ProvidersLlama, Qwen, DeepSeek-V4, MistralYesOften competitive for open modelsBroad open-model rangeNoYesYesAvailable11/15
Fireworks AIInference ProvidersLlama, Qwen, DeepSeek-V4, MistralYesCompetitive serverless tiers for open modelsBroad open-model range, model specificNoYesYesAvailable11/15
GroqInference ProvidersLlama, Mixtral, Gemma, Whisper-like speech modelsYesSpeed-oriented model tiersSelected fast-serving model range, model specificYesNoYesAvailable11/15
DeepInfraInference ProvidersLlama, Qwen, DeepSeek-V4, MistralYesOften low for open modelsBroad open-model range, model specificNoYesYesAvailable10/15
ReplicateInference Providersopen models, image models, audio models, video modelsNoRuntime dependentModel dependentNoYesYesAvailable10/15
BasetenInference Providerscustom models, open modelsNoDeployment dependentModel dependentNoYesYesAvailable10/15
OpenRouterLLM API AggregatorsGPT, Claude, Gemini, DeepSeek-V4YesVaries by model routeModel dependent across upstream routesNoYesYesAvailable11/15
PortkeyLLM API AggregatorsGPT, Claude, Gemini, DeepSeek-V4YesPlan dependent plus provider spendProvider dependentNoYesYesAvailable11/15
LiteLLM CloudLLM API AggregatorsGPT, Claude, Gemini, MistralYesPlan dependentProvider dependentNoYesYesUnclear10/15
HeliconeLLM API Aggregatorsprovider dependentYesPlan dependentProvider dependentNoYesYesAvailable11/15
Perplexity APIOpenAI-Compatible APIsSonar, online modelsYesVaries by modelModel dependentNoNoYesAvailable10/15
Novita AIOpenAI-Compatible APIsLlama, Qwen, DeepSeek-V4, image modelsYesVaries by modelModel dependentNoYesYesUnclear10/15
AI/ML APIOpenAI-Compatible APIsGPT-style models, Claude-style access, Gemini-style access, open modelsYesVaries by modelModel dependentNoYesYesUnclear9/15
Anyscale EndpointsInference Providersopen models, custom deploymentsYesUnclearModel dependentNoNoYesUnclear10/15
Voyage AIOfficial APIsembeddings, rerankersNoVaries by modelModel dependentNoNoNoUnclear9/15

Most practical starting points

These are common providers when teams want one strong baseline in each decision direction.

Official APIs

OpenAI

Official API for GPT models, multimodal capabilities, embeddings, realtime use cases, and broad developer tooling.

  • general AI apps
  • Budget to premium GPT tiers
  • Trust 12/15
Official APIs

Anthropic

Official Claude API with strong long-context, coding, writing, and agent-oriented use cases.

  • coding
  • Mid to premium Claude tiers
  • Trust 10/15
Official APIs

DeepSeek

Official DeepSeek API for the DeepSeek-V4 family, with OpenAI and Anthropic compatible formats plus very large context windows.

  • cost-effective long-context apps
  • Low-cost flash to discounted pro tiers
  • Trust 11/15
LLM API Aggregators

OpenRouter

Unified API for accessing many models and providers through a routing and marketplace-style interface.

  • model comparison
  • Varies by model route
  • Trust 11/15
Inference Providers

DeepInfra

Serverless inference platform with a broad model catalog and OpenAI-compatible endpoints for many models.

  • low-cost open model inference
  • Often low for open models
  • Trust 10/15

Compare these next

Once the table has helped you narrow the field, move into head-to-head decisions.