"Inference Engineering" is now available. Get your copy here

Model library

Browse our library of open source models that are ready to deploy behind an API endpoint in seconds.

28 Qwen models

Qwen Logo
LLM

Qwen3.5 9B Latency

V1 - Latency - vLLM - H100
Qwen Logo
LLM

Qwen3.5 35B-A3B Latency

V1 - Latency - vLLM - H100
Qwen Logo
LLM

Qwen3.5 122B-A10B Latency

V1 - Latency - vLLM - H100
Qwen Logo
Model API
LLM

Qwen3 Coder 480B

3 - Coder
Qwen Logo
LLM

Qwen 3 32B

V3 - TRT-LLM - H100
Qwen Logo
LLM

Qwen3 VL 235B

3 - Vision Language
Qwen Logo
LLM

Qwen3 Coder 30B

3 - Coder
Qwen Logo
LLM

Qwen 3 235B

V3 - SGLang - H100
Qwen Logo
LLM

Qwen 3 4B

V3 - TRT-LLM - H100
Qwen Logo
LLM

Qwen 2.5 14B Instruct

2.5 - TRT-LLM - H100
Qwen Logo
LLM

Qwen 2.5 32B Coder Instruct

2.5 - Coder - TRT-LLM - H100
Qwen Logo
LLM

Qwen 2.5 7B Math Instruct

2.5 - Math - TRT-LLM - H100 MIG 40GB
Qwen Logo
LLM

Qwen 2.5 32B QwQ

2.5 - QwQ - TRT-LLM - H100
Qwen Logo
LLM

Qwen3.5 4B Latency

V1 - Latency - vLLM - H100
Qwen Logo
Text to speech

Qwen3 TTS 12Hz Base Streaming 1.7B

TTS - 12Hz Base
Qwen Logo
Text to speech

Qwen3 TTS 12Hz Base Streaming 0.6B

TTS - 12Hz Base
Qwen Logo
Transcription

Qwen 3 ASR 1.7B

Qwen Logo
LLM

Qwen3 Omni Thinker

Omni - Thinker
Qwen Logo
LLM

Qwen3 Next 80B A3B Instruct

Qwen3 Next 80B A3B Instruct - Instruct - SGLang - H100
Qwen Logo
LLM

Qwen3 Next 80B A3B Thinking

Qwen3 Next 80B A3B Instruct - Instruct - SGLang - H100
Qwen Logo
LLM

Qwen 2.5 72B Instruct

2.5 - TRT-LLM - H100
Qwen Logo
LLM

Qwen 2.5 72B Math Instruct

2.5 - Math - TRT-LLM - H100
Qwen Logo
LLM

Qwen 2.5 14B Coder Instruct

2.5 - Coder - TRT-LLM - H100
Qwen Logo
LLM

Qwen 2.5 32B Instruct

2.5 - TRT-LLM - H100
Qwen Logo
LLM

Qwen 2.5 7B Coder Instruct

2.5 - Coder - TRT-LLM - H100 MIG 40GB
Qwen Logo
LLM

Qwen 2.5 7B Instruct

2.5 - TRT-LLM - H100 MIG 40GB
Qwen Logo
LLM

Qwen 2.5 3B Instruct

2.5 - TRT-LLM - A10G

🔥 Trending models