Platform
Platform
Resources
Resources
Pricing
Pricing
Docs
Docs
Log in
Get started
Model library
Browse our library of open source models that are ready to deploy behind an API endpoint in seconds.
Deploy your own model
Filter by
All
LLM
Transcription
Text to speech
Image generation
Embedding
28 H100 models
Model API
LLM
Llama 4 Scout
V4.0
-
Instruct
-
vLLM
-
H100
LLM
Qwen 3 235B
V3
-
SGLang
-
H100
LLM
Qwen 3 4B
V3
-
TRT-LLM
-
H100
Image generation
ZenCtrl
Custom Server
-
H100
LLM
Qwen 3 32B
V3
-
TRT-LLM
-
H100
Embedding
BGE Reranker M3
BEI
-
H100
Embedding
BGE Embedding ICL
BEI
-
H100
LLM
Llama 3.3 Nemotron 49B Super - NVIDIA NIM
3.3
-
Nemotron
-
H100
LLM
Mistral Small 3.1
3.1
-
vLLM
-
H100
LLM
Gemma 3 27B IT
3
-
Instruct
-
vLLM
-
H100
LLM
DeepSeek-R1 Llama 70B
R1
-
Llama
-
TRT-LLM
-
H100
LLM
Llama 3.3 70B Instruct
3.3
-
TRT-LLM
-
H100
LLM
DeepSeek-R1 Qwen 32B
R1
-
Qwen
-
TRT-LLM
-
H100
LLM
Qwen 2.5 14B Instruct
2.5
-
TRT-LLM
-
H100
LLM
Qwen 2.5 32B Coder Instruct
2.5
-
Coder
-
TRT-LLM
-
H100
LLM
Llama 3.1 8B Instruct
3.1
-
Instruct
-
TRT-LLM
-
H100
LLM
Qwen 2.5 32B QwQ
2.5
-
QwQ
-
TRT-LLM
-
H100
LLM
Llama 3.1 405B Instruct
3.1
-
Instruct
-
H100
LLM
Llama 3.1 Nemotron Ultra 253B
3.1
-
Nemotron
-
TRT-LLM
-
H100
LLM
Pixtral 12B
Pixtral
-
vLLM
-
H100
LLM
Qwen 2.5 72B Instruct
2.5
-
TRT-LLM
-
H100
LLM
Qwen 2.5 72B Math Instruct
2.5
-
Math
-
TRT-LLM
-
H100
LLM
Qwen 2.5 14B Coder Instruct
2.5
-
Coder
-
TRT-LLM
-
H100
LLM
Qwen 2.5 32B Instruct
2.5
-
TRT-LLM
-
H100
LLM
Llama 3.1 70B Instruct
3.1
-
Instruct
-
TRT-LLM
-
H100
LLM
Llama 3.2 90B Vision Instruct
3.2
-
Vision
-
H100
LLM
Mixtral 8x7B Instruct
v1
-
TRT-LLM
-
H100
LLM
Mixtral 8x22B
H100
🔥 Trending models
LLM
Qwen 3 235B
V3
-
SGLang
-
H100
Text to speech
Orpheus TTS
TRT-LLM
-
H100 MIG 40GB
Model API
LLM
DeepSeek-R1
R1
-
SGLang
-
B200
Model API
LLM
Llama 4 Maverick
V4.0
-
Instruct
-
vLLM
-
B200
Explore Baseten today
Start deploying
Talk to an engineer