Changelog

See our latest feature releases, product improvements and bug fixes

Jul 30, 2025

gRPC support

We now support calling models via gRPC! gRPC is type-safe, supports streaming, and is language interoperable, making it great for: Low-latency applications (e.g., video processing) Microservices Read...

Jul 24, 2025

Private Docker images for Training

Baseten Training now supports private Docker images for GCP and AWS! The integration is simple: Add your authentication details for your container registry into Baseten’s Secrets For GCP, this will...

Jul 22, 2025

WebSocket support for real-time model streaming

We’ve revamped the experience of using WebSockets to invoke your deployments. Here’s what’s new: WebSocket-specific metrics in the UI, like average connection duration Improved UX for managing and...

Jul 14, 2025

Workspace redesign

We’ve redesigned the workspace experience to make it easier to see what’s happening and get to what you need. Here’s what’s new: New overview page that gives you a high-level snapshot of everything...

Jul 7, 2025

SSO support now available

We’ve added support for Single Sign-On (SSO) with all major identity providers, including Okta, Google Workspace, Azure AD, and others. This allows you to authenticate through your existing IdP,...

May 21, 2025

Introducing two new products: Model APIs and Training

Today we're introducing two new products: Baseten Model APIs and Training. Model APIs are frontier models running on the Baseten Inference Stack, purpose-built for production. Today we’re launching...

May 19, 2025

Introducing our new brand

We're thrilled to introduce our new brand! We believe inference is the foundation of all AI going forward. That's what our new look is all about: Baseten is the building blocks of AI. Baseten is...

Apr 15, 2025

Early Access: Announcing B200s on Baseten

We're thrilled to announce early access to NVIDIA B200 GPUs on Baseten! From benchmarks on models like DeepSeek R1, Llama 4, and Qwen, we’re already seeing 5x higher throughput, over 2x better cost...

Apr 14, 2025

Flexible instance types per model deployment

Model deployments now support changing instance types, enabling you to experiment with different hardware configurations and use specific hardware for staging, development, and production...

Apr 10, 2025

Stream Baseten logs from the terminal

For users who love working in the terminal, we're excited to announce truss push --tail , which streams Baseten logs directly to your command line. You no longer need to switch context between your...