Back to companies

Baseten

🇺🇸

GrowthSan Francisco, California, United Stateswww.baseten.co/

Total funding$585M

ConfidenceHigh 24Medium 19Low 2

Company info

Full nameBaseten

Founded2019年

HeadquartersSan Francisco, California, United States

Websitewww.baseten.co/

Region🇺🇸 United States

StageGrowth

Employees51-200（LinkedIn company size

Report date2026-03-10

Overview

AI inference infrastructure platform for production deployment, optimization, and scaling of open and custom models.

Industry tags

AI InfrastructureModel ServingEnterprise AI PlatformDeveloper Tools

Key people

NameRole

Tuhin SrivastavaCEO, Co-Founder

Amir HaghighatCTO, Co-Founder

Phil HowesCo-Founder

Pankaj GuptaCo-Founder

Dannie HerzbergPresident (GTM and Operations)

Joey ZwickerHead of Forward Deployed Engineering

Jay SimonsBoard member (joined with Series D announcement)

Core products and services

7 products

Inference platform

Baseten Inference Stack

面向生产环境的核心推理栈，覆盖性能优化、推理基础设施与模型管理。

High confidence · 2 sources · 2+ independent authoritative sources

Deployment product

Dedicated Deployments

提供专属模型部署形态，支持运行时与扩缩容控制。

High confidence · 2 sources · 2+ independent authoritative sources

Managed model API

Model APIs

通过Baseten推理栈提供生产可用的前沿/开源模型API。

High confidence · 2 sources · 2+ independent authoritative sources

Training infrastructure

Training

提供多节点训练工作流并衔接部署。

High confidence · 2 sources · 2+ independent authoritative sources

Hybrid deployment

Baseten Hybrid

支持在客户VPC运行推理，并可溢出到Baseten Cloud。

High confidence · 2 sources · 2+ independent authoritative sources

Compound AI framework

Baseten Chains

面向低延迟复合AI系统与多模型工作流的框架。

High confidence · 2 sources · 2+ independent authoritative sources

Funding history

Total funding 约$585M+（按公开可验证轮次去重估算，不重复计入Series E中NVIDIA子份额）

Date	Round	Amount	Valuation	Investors	Confidence
2019年	Seed	未单独披露（并入Seed+Series A累计）	未披露	Greylock	Medium confidence · 1 sources · Single authoritative source
2023年	Series A	Seed+Series A累计略高于$20M（Series A单独金额未披露）	未披露	Greylock, South Park Commons, Lachy Groom, Caffeinated Capital	Medium confidence · 1 sources · Single authoritative source
2024年03月	Series B	$40M	未披露	云启资本	Medium confidence · 1 sources · Single authoritative source
2025年02月	Series C	$75M	$825M-$850M（媒体口径区间）	IVP, Spark, Greylock, Conviction, South Park Commons, 01 Advisors, Lachy Groom	High confidence · 2 sources · 2+ independent authoritative sources
2025年09月	Series D	$150M	$2.15B	Bond, CapitalG, Premji Invest, Scribble Ventures, Conviction, 01A, IVP, Spark, Greylock, BoxGroup	High confidence · 2 sources · 2+ independent authoritative sources
2026年01月	Series E	$300M	$5B	IVP, CapitalG（lead）及NVIDIA	High confidence · 2 sources · 2+ independent authoritative sources
2026年01月	Strategic Investment	$150M（NVIDIA作为交易一部分）	对应$5B轮次口径	NVIDIA	High confidence · 2 sources · 2+ independent authoritative sources
2026年02月	Series C（historical confirmation post）	$75M（历史轮次再次确认）	未披露	IVP, Spark, Greylock, Conviction, South Park Commons, Basecase, Lachy Groom, 01A	High confidence · 2 sources · 2+ independent authoritative sources

Product release timeline

2025年07月Medium confidence · 1 sources · Single authoritative source

Workspace Workspace redesign

工作区体验重做，统一部署/API/训练操作面板。

2025年05月High confidence · 2 sources · 2+ independent authoritative sources

Product line Model APIs + Training launch

新增Model APIs与Training两条核心产品线。

2025年04月Medium confidence · 1 sources · Single authoritative source

Documentation Docs refresh

文档体系与导航重构。

2025年03月Medium confidence · 1 sources · Single authoritative source

API Fully OpenAI compatible

官方宣布对OpenAI兼容API提供完整支持。

2025年02月Medium confidence · 1 sources · Single authoritative source

Developer experience Baseten Chains GA

Chains进入GA，面向低延迟compound AI系统。

2025年01月Medium confidence · 1 sources · Single authoritative source

Reliability Custom health checks

部署健康检查可自定义，强化故障前置监控。

2025年Medium confidence · 1 sources · Single authoritative source

Deployment Baseten Hybrid

发布Hybrid部署形态，支持自有VPC与Baseten云混合。

2024年12月Medium confidence · 1 sources · Single authoritative source

Observability New metrics dashboard customization

指标面板重构，支持统一视图与可定制布局。

2024年02月Medium confidence · 1 sources · Single authoritative source

Infrastructure H100 support

上线H100推理并披露价格/性能改进口径。

2024年01月Medium confidence · 1 sources · Single authoritative source

Infrastructure NVIDIA L4 GA

L4 GPU实例正式可用，扩展推理硬件选择。

2023年04月Medium confidence · 1 sources · Single authoritative source

Pricing Usage-based pricing

Startup plan转为纯按量计费并提供免费credits。

2022年12月Medium confidence · 1 sources · Single authoritative source

Model management Configure model resources

支持自定义CPU/GPU/副本与autoscaling参数。

Key events

2026

宣布acquihire Inferless，补强推理基础设施人才与技术。

完成$300M融资并达到$5B估值（Series E）。

NVIDIA在该轮中作为战略投资方参与；中文财经媒体披露其投资额$150M。

2025

收购Parsed，强化RL后训练与持续学习能力。

完成Series D（$150M）并达到$2.15B估值，Jay Simons加入董事会。

Joey Zwicker加入并担任Head of Forward Deployed Engineering。

Dannie Herzberg加入并担任President，负责GTM与运营。

完成Series C（$75M），媒体估值约$825M-$850M。

与AWS/NVIDIA生态合作案例公开，强调多云+GPU推理协同能力。

2024

完成Series B（$40M），进入更快商业化扩张阶段。

2023

对外披露Seed与Series A累计融资略超$20M。

2019

Baseten由四位联合创始人创立，聚焦推理基础设施。

Competitive landscape

AI inference platform (startup)

**Together AI (Together Inference / GPU cloud)** — 以开源模型运行与GPU云能力切入，偏开发者与模型团队的快速部署；其商业化以推理与算力消费为核心。相较之下，Baseten强调企业控制面、多云管理与SLO，面向更强合规与生产级稳定性诉求。[Source](https://www.baseten.co/compare/together-ai/)

AI model API / hosting

**Fireworks AI (Serverless model inference APIs)** — 走高性能推理API路线，通常以按调用/吞吐计费服务AI应用团队。与Baseten同赛道竞争点在于延迟、吞吐与成本效率；Baseten的差异是将Dedicated Deployments与Hybrid能力打包为企业交付方案。[Source1](https://www.baseten.co/compare/together-ai/) [Source2](https://www.eesel.ai/blog/baseten)

Serverless GPU platform

**Modal (Modal serverless compute)** — 以Python/serverless工作流和批处理见长，GTM偏工程团队自助；在全栈企业治理与复杂生产控制上通常需额外架构补齐。Baseten则更强调推理生产化、SLA与企业部署路径。[Source1](https://northflank.com/blog/modal-vs-baseten-vs-northflank) [Source2](https://introl.com/blog/serverless-gpu-platforms-runpod-modal-beam-comparison-guide-2025)

Serverless GPU platform

**Runpod (Runpod GPU cloud)** — 以GPU供给与性价比为核心卖点，适合成本敏感与快速实验团队。Baseten与其差异主要在企业级运维能力、可观测性与标准化生产交付。[Source](https://introl.com/blog/serverless-gpu-platforms-runpod-modal-beam-comparison-guide-2025)

Model hosting API

**Replicate (Replicate model APIs)** — 强调模型托管与API调用便捷性，开发者采用门槛低，适合快速集成。Baseten在企业场景中主打更深的推理优化、专属部署与混合云控制能力。[Source1](https://introl.com/blog/serverless-gpu-platforms-runpod-modal-beam-comparison-guide-2025) [Source2](https://www.eesel.ai/blog/baseten)

Hyperscaler AI platform

**AWS (SageMaker / Bedrock)** — 依托云平台全栈能力与渠道覆盖，常以平台绑定和企业采购体系推动落地。Baseten在此类竞争中以“专注推理性能 + 跨云灵活性”作为替代/补充定位。[Source1](https://www.eesel.ai/blog/baseten) [Source2](https://aws.amazon.com/partners/success/baseten-nvidia/)

Hyperscaler AI platform

**Google Cloud (Vertex AI)** — 与GCP生态和数据治理深度耦合，适合已在Google Cloud内部署的大型企业。Baseten差异在于更中立的多云策略与推理平台专精化能力。[Source](https://www.eesel.ai/blog/baseten)

Hyperscaler AI platform

**Microsoft (Azure AI)** — 依托微软企业客户覆盖、合规与生态分发能力，GTM偏平台整合。Baseten则通过专门化推理栈与交付团队争夺高性能AI应用客户。[Source](https://www.eesel.ai/blog/baseten)

Growth metrics

Revenue scaleseveral million dollars（Forbes表述）—2024年

Revenue growthrevenue grew more than 10x over prior 12 months10x+2025年09月

Platform growthgrown over 5x year-over-year5x+2025年02月

Reliability SLA signal99.999% uptime—2025年02月

Performance relative to peersaverage 60%+ better throughput/latency metrics vs competitors—2025年02月

EmployeesLinkedIn company size 51-200—2026年

Competitive narrative

Differentiators

多云容量管理（官方口径9+ clouds）与高可用性叙事（99.99%/99.999%口径）。

围绕推理性能的工程深耕（TensorRT-LLM、自研优化、speculative decoding）。

从Inference扩展到Model APIs与Training，形成更完整AI生产栈。

强调企业可控性：数据驻留、self-hosted/hybrid、可观测与部署控制。

Challenges and risks

与超大云厂商及资本化推理平台竞争，价格战与GPU供给波动风险高。

部分增长指标来自公司PR或单一报道，缺少审计口径ARR与利润率披露。

高速融资对应高估值，后续需持续证明单位经济性与客户留存质量。

并购与acquihire整合执行风险（Parsed、Inferless团队融合与产品路线收敛）。

Market position

Baseten位于“AI应用层推理基础设施”核心赛道，已从部署工具扩展为覆盖Dedicated Deployments、Model APIs、Training与Hybrid的多产品平台。2025-2026连续大额融资与估值跃升，反映资本市场对其作为推理底座供应商的高预期。竞争结构上，Baseten一端面对Together/Modal/Runpod/Replicate等专业平台，另一端与AWS/Google/Microsoft等超大云平台重叠。其持续领先的关键在于性能与可靠性的可验证兑现、企业级交付效率，以及在多模型时代的成本效率管理。

Sources

baseten.co — baseten.coHigh confidence · 2+ independent authoritative sources baseten.co — baseten.coHigh confidence · 2+ independent authoritative sources linkedin.com — linkedin.comHigh confidence · 2+ independent authoritative sources baseten.co — baseten.coHigh confidence · 2+ independent authoritative sources S1 — baseten.coHigh confidence · 2+ independent authoritative sources