Model library

Browse our library of open source models that are ready to deploy behind an API endpoint in seconds.

Deploy your own model

All LLM Transcription Text to speech Image generation Embedding

28 H100 models

LLM

Qwen 3 4B

V3 - TRT-LLM - H100

Image generation

ZenCtrl

Custom Server - H100

LLM

Qwen 3 32B

V3 - TRT-LLM - H100

LLM

Qwen 3 235B

V3 - SGLang - H100

LLM

Llama 4 Scout

V4.0 - Instruct - vLLM - H100

Embedding

BGE Reranker M3

BEI - H100

Embedding

BGE Embedding ICL

BEI - H100

LLM

Llama 3.3 Nemotron 49B Super - NVIDIA NIM

3.3 - Nemotron - H100

LLM

Mistral Small 3.1

3.1 - vLLM - H100

LLM

Gemma 3 27B IT

3 - Instruct - vLLM - H100

LLM

DeepSeek-R1 Llama 70B

R1 - Llama - TRT-LLM - H100

LLM

Llama 3.3 70B Instruct

3.3 - TRT-LLM - H100

LLM

DeepSeek-R1 Qwen 32B

R1 - Qwen - TRT-LLM - H100

LLM

Qwen 2.5 14B Instruct

2.5 - TRT-LLM - H100

LLM

Qwen 2.5 32B Coder Instruct

2.5 - Coder - TRT-LLM - H100

LLM

Llama 3.1 8B Instruct

3.1 - Instruct - TRT-LLM - H100

LLM

Qwen 2.5 32B QwQ

2.5 - QwQ - TRT-LLM - H100

LLM

Llama 3.1 405B Instruct

3.1 - Instruct - H100

LLM

Llama 3.1 Nemotron Ultra 253B

3.1 - Nemotron - TRT-LLM - H100

LLM

Pixtral 12B

Pixtral - vLLM - H100

LLM

Qwen 2.5 72B Instruct

2.5 - TRT-LLM - H100

LLM

Qwen 2.5 72B Math Instruct

2.5 - Math - TRT-LLM - H100

LLM

Qwen 2.5 14B Coder Instruct

2.5 - Coder - TRT-LLM - H100

LLM

Qwen 2.5 32B Instruct

2.5 - TRT-LLM - H100

LLM

Llama 3.1 70B Instruct

3.1 - Instruct - TRT-LLM - H100

LLM

Llama 3.2 90B Vision Instruct

3.2 - Vision - H100

LLM

Mixtral 8x7B Instruct

v1 - TRT-LLM - H100

LLM

Mixtral 8x22B

H100

🔥 Trending models

LLM

Qwen 3 235B

V3 - SGLang - H100

Text to speech

Orpheus TTS

vLLM - H100 MIG 40GB

LLM

DeepSeek-R1

R1 - SGLang - B200

LLM

Llama 4 Maverick

V4.0 - Instruct - vLLM - B200

Explore Baseten today

Start deploying

Talk to an engineer