Model library

Browse our library of open source models that are ready to deploy behind an API endpoint in seconds.

🔥 Trending models

large language models

See all
Z AI
LLM

GLM-4.5V

4.5 - Vision
Meta logo
Model API
LLM

Llama 4 Scout

V4.0 - Instruct - vLLM - H100
Meta logo
Model API
LLM

Llama 4 Maverick

V4.0 - Instruct - vLLM - B200
DeepSeek Logo
Model API
LLM

DeepSeek V3 0324

V3 - 0324 - B200
DeepSeek Logo
Model API
LLM

DeepSeek R1 0528

R1 - 0528 - B200
Qwen Logo
Model API
LLM

Qwen3 235B 2507

2507

text to speech models

See all
Canopy Labs Logo
Text to speech

Orpheus 3B Websockets

TRT-LLM - H100 MIG 40GB
Canopy Labs Logo
Text to speech

Orpheus TTS

TRT-LLM - H100 MIG 40GB
three triangles with the bottom edge missing inside each other
Text to speech

MARS6

V6 - L4
Coqui
Text to speech

XTTS V2

T4
glowing gold goddess, anime style art. Kokoro from the anime Terminator
Text to speech

Kokoro

fp16 - T4

transcription models

See all
OpenAI logo
Transcription

Whisper Large V3 (best performance)

V3 - H100 MIG 40GB
OpenAI logo
Transcription

Whisper Streaming Large v3

H100 MIG 40GB
OpenAI logo
Transcription

Whisper Streaming Large v3 Turbo

H100 MIG 40GB
Fixie Logo
Transcription

Ultravox v0.6 70B

v0.6 - H100
Mistral AI logo
Transcription

Voxtral Small 24B

2507 - Small - H100
Mistral AI logo
Transcription

Voxtral Mini 3B

2507 - Mini - H100 MIG 40GB

image generation models

See all
Qwen Logo
Image generation

Qwen Image

Text-to-Image
Fotographer AI
Image generation

ZenCtrl

Custom Server - H100
ByteDance logo
Image generation

SDXL Lightning

1.0 - Lightning - A100
Stability AI logo
Image generation

Stable Diffusion 3 Medium

3 - A100
Stability AI logo
Image generation

Stable Diffusion XL

XL 1.0 - A10G
Fotographer AI
Image generation

ZenCtrl Pro

Custom Server - H100

embedding models

See all
Qwen Logo
Embedding

Qwen3 8B Reranker

BEI - H100 MIG 40GB
Qwen Logo
Embedding

Qwen3 8B Embedding

BEI - H100 MIG 40GB
Allen AI
Embedding

Tulu 3 8B Reward

V3 - Reward - BEI - H100 MIG 40GB
BAAI
Embedding

BGE Reranker M3

BEI - H100
BAAI
Embedding

BGE Embedding ICL

BEI - H100
Nomic AI logo
Embedding

Nomic Embed Code

BEI - H100 MIG 40GB

DeepSeek models

See all
DeepSeek Logo
Model API
LLM

DeepSeek V3 0324

V3 - 0324 - B200
DeepSeek Logo
Model API
LLM

DeepSeek R1 0528

R1 - 0528 - B200
DeepSeek Logo
LLM

DeepSeek-R1 Llama 70B

R1 - Llama - TRT-LLM - H100
DeepSeek Logo
LLM

DeepSeek-R1 Qwen 32B

R1 - Qwen - TRT-LLM - H100
DeepSeek Logo
LLM

DeepSeek-R1 Qwen 7B

R1 - Qwen - TRT-LLM - H100 MIG 40GB
DeepSeek Logo
LLM

DeepSeek-R1 Zero

R1 - Zero - SGLang - H200

Qwen models

See all
Qwen Logo
Model API
LLM

Qwen3 235B 2507

2507
Qwen Logo
Model API
LLM

Qwen3 Coder 480B

3 - Coder
Qwen Logo
LLM

Qwen3 Coder 30B

3 - Coder
Qwen Logo
Image generation

Qwen Image

Text-to-Image
Qwen Logo
Embedding

Qwen3 8B Reranker

BEI - H100 MIG 40GB
Qwen Logo
Embedding

Qwen3 8B Embedding

BEI - H100 MIG 40GB

Meta models

See all
Meta logo
Model API
LLM

Llama 4 Scout

V4.0 - Instruct - vLLM - H100
Meta logo
Model API
LLM

Llama 4 Maverick

V4.0 - Instruct - vLLM - B200
Meta logo
LLM

Llama 3.3 70B Instruct

3.3 - TRT-LLM - H100
Meta logo
LLM

Llama 3.1 8B Instruct

3.1 - Instruct - TRT-LLM - H100
Meta logo
LLM

Llama 3.1 405B Instruct

3.1 - Instruct - H100
Meta logo
LLM

Llama 3.2 11B Vision Instruct

3.2 - Vision - A100