Changelog
See our latest feature releases, product improvements and bug fixes
gRPC support
We now support calling models via gRPC! gRPC is type-safe, supports streaming, and is language interoperable, making it great for: Low-latency applications (e.g., video processing) Microservices Read...
Jul 24, 2025Private Docker images for Training
Baseten Training now supports private Docker images for GCP and AWS! The integration is simple: Add your authentication details for your container registry into Baseten’s Secrets For GCP, this will...
Jul 22, 2025WebSocket support for real-time model streaming
We’ve revamped the experience of using WebSockets to invoke your deployments. Here’s what’s new: WebSocket-specific metrics in the UI, like average connection duration Improved UX for managing and...
Jul 14, 2025Workspace redesign
We’ve redesigned the workspace experience to make it easier to see what’s happening and get to what you need. Here’s what’s new: New overview page that gives you a high-level snapshot of everything...
Jul 7, 2025SSO support now available
We’ve added support for Single Sign-On (SSO) with all major identity providers, including Okta, Google Workspace, Azure AD, and others. This allows you to authenticate through your existing IdP,...
May 21, 2025Introducing two new products: Model APIs and Training
Today we're introducing two new products: Baseten Model APIs and Training. Model APIs are frontier models running on the Baseten Inference Stack, purpose-built for production. Today we’re launching...
May 19, 2025Introducing our new brand
We're thrilled to introduce our new brand! We believe inference is the foundation of all AI going forward. That's what our new look is all about: Baseten is the building blocks of AI. Baseten is...
Apr 15, 2025Early Access: Announcing B200s on Baseten
We're thrilled to announce early access to NVIDIA B200 GPUs on Baseten! From benchmarks on models like DeepSeek R1, Llama 4, and Qwen, we’re already seeing 5x higher throughput, over 2x better cost...
Apr 14, 2025Flexible instance types per model deployment
Model deployments now support changing instance types, enabling you to experiment with different hardware configurations and use specific hardware for staging, development, and production...
Apr 10, 2025Stream Baseten logs from the terminal
For users who love working in the terminal, we're excited to announce truss push --tail , which streams Baseten logs directly to your command line. You no longer need to switch context between your...