Baseten introduces rolling deployments for zero-downtime model updates
The feature updates models incrementally without doubling GPU spend and offers pause, resume, and rollback controls.
See the latest news and media coverage for Baseten. We track all announcements, press releases, and industry mentions in real time, all in one place.
AI model inference platform
baseten.coLast updated
In short: Baseten expanded its inference platform with frontier model support and technical optimizations while entering reported talks for a $1 billion raise.
The feature updates models incrementally without doubling GPU spend and offers pause, resume, and rollback controls.
Mercury 2 runs over 1000 tokens per second on NVIDIA GPUs, at half the cost of comparable models, with 90% cost reduction for Augment Code.
It uses Mamba layers to keep step time constant as context grows, achieving up to 5x faster inference and 30% lower cost.
The flagship reasoning model offers clean data lineage and customization options with fine-tuning via Baseten Loops.
Angel Au-Yeung / Wall Street Journal: AI inference startup Baseten is raising $1.5B in a dual-tiered deal, with some investors putting in money at an...
AI startup Baseten has recently been in talks with investors to raise $1 billion at an $11 billion valuation including the money, according to a...
Nvidia invested $150 million in Baseten, which helps companies deploy and run large AI models.
Kate Clark / Wall Street Journal: Sources: AI inference startup Baseten raised $300M led by IVP and CapitalG at a $5B valuation, more than doubling...
Track Baseten and your other target companies to get real-time alerts and weekly summaries delivered straight to your inbox.
Browse news for competitors to Baseten and other trending companies.