Inference ProvidersFast inference platform for open models with serverless APIs, fine-tuning, and deployment options.
Models: Llama, Qwen, DeepSeek-V4
low-latency open model appsCompetitive serverless tiers for open modelsBroad open-model range, model specific
Yes OpenAI-compatibleNo tool calling listedTrust 11/15
Inference ProvidersServerless inference platform with a broad model catalog and OpenAI-compatible endpoints for many models.
Models: Llama, Qwen, DeepSeek-V4
low-cost open model inferenceOften low for open modelsBroad open-model range, model specific
Yes OpenAI-compatibleNo tool calling listedTrust 10/15
Inference ProvidersInference provider known for very fast LPU-backed serving of selected open and partner models.
Models: Llama, Mixtral, Gemma
low-latency chatSpeed-oriented model tiersSelected fast-serving model range, model specific
Yes OpenAI-compatibleTool callingTrust 11/15
Inference ProvidersAPI platform for running community and commercial machine learning models, including text, image, audio, and video models.
Models: open models, image models, audio models
experimental ML appsRuntime dependentModel dependent
No OpenAI-compatibleNo tool calling listedTrust 10/15