Baseten

At Baseten we provide all the infrastructure you need to deploy and serve ML models performantly, scalably, and cost-efficiently. Get started in minutes, and avoid getting tangled in complex…

baseten.co

Baseten

At Baseten we provide all the infrastructure you need to deploy and serve ML models performantly, scalably, and cost-efficiently. Get started in minutes, and avoid getting tangled in complex…

baseten.co

HQUS

Team Size233

Open Jobs12

Total Funding$585M

Latest FundraiseUnknown

TL;DR

Sector: AI inference / ML infrastructure

Founded: 2019

Headquarters: San Francisco, California

Employee count (reported): 204

Notable recent funding: $300M Series E (company disclosure)

Company Overview

Problem Domain

Production inference and serving for machine-learning models (including LLMs) with emphasis on scalability, performance, and cost control.

Founded

2019

Industry

Software Development

Funding Track Record

Series B- March 2024

$40M

Series C- February 2025

$75M

Series D- September 2025

$150M

Series E

$300M

Company disclosure reporting $300M Series E at $5B valuation

Investor Signal

“Baseten has raised late-stage rounds with participation from investors including Bond, IVP, CapitalG, Spark Capital, NVIDIA, Greylock, Conviction, 01 Advisors, BoxGroup, and others.”

Founders

What we do

Join the Team

Product Manager - Infrastructure

HybridNew York, NY, US

Hybrid • New York, NY, US

Related Companies

Company	HQ	Industry	Total Funding
Weights & Biases	🇺🇸US	Software	$250M
Modular	🇺🇸US	Data and AnalyticsDeepTechInformation TechnologySoftware	$380M
SuperAnnotate	🇺🇸US	Data and AnalyticsDeepTechHardwareInformation TechnologySoftware	$18M
LangChain	🌍Remote	Data and AnalyticsDeepTechInformation TechnologySoftware	-
Hebbia	🇺🇸US	Data and AnalyticsDeepTechInformation TechnologySoftware	$160M

About Baseten

Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $150M Series D, backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products.

THE ROLE

As an Infrastructure Product Manager at Baseten, you’ll own the roadmap for our core inference and compute infrastructure, ensuring our platform delivers world-class reliability, scalability, and performance. You’ll work closely with engineering teams to define how we handle large-scale distributed systems, optimize GPU utilization, and provide enterprise-grade security and observability. This is a deeply technical role that bridges engineering excellence and customer impact, ensuring Baseten’s infrastructure is a foundation our users can depend on.

EXAMPLE INITIATIVES

You'll get to work on these types of projects as part of our Infrastructure team:

Multi-cloud capacity management
Inference on B200 GPUs
Multi-node inference
Fractional H100 GPUs for efficient model serving

Responsibilities

Define the product vision and roadmap for Baseten’s inference, serving, and orchestration infrastructure
Collaborate with engineering to improve the reliability, latency, and cost efficiency of model deployments
Partner with Forward Deployed Engineering and customer teams to translate performance needs into infrastructure investments
Drive internal platform scalability from multi-GPU support to hybrid cloud architecture
Establish metrics for uptime, latency, and cost, ensuring we deliver best-in-class performance and efficiency
Lead cross-functional initiatives around observability, deployment automation, and infrastructure security

Requirements

4+ years of Product Management experience in developer platforms, infrastructure, or ML systems
Engineering background (e.g., degree in Computer Science, Electrical Engineering, or related field; or equivalent hands-on experience as a software engineer)
Strong technical understanding of distributed systems, cloud computing, and GPU-based workloads
Proven track record of shipping technical products with measurable reliability or performance improvements
Excellent communication and prioritization skills with deeply technical teams

NICE TO HAVE

Experience with Kubernetes, autoscaling systems, or inference optimization
Understanding of LLM and multimodal model serving
Prior experience at a company building infrastructure for ML or developer tools

Benefits

Competitive compensation, including meaningful equity.
100% coverage of medical, dental, and vision insurance for employee and dependents
Generous PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)
Paid parental leave
Company-facilitated 401(k)
Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.

At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.

Startup jobs. A lot of them.

Your next opportunity is in here somewhere. Sign up to explore 52,000+ startups and their open roles. No spam. No gamification. Just jobs.

52,000+

Startups

65,000+

Open Roles

1,500+

New This Week

Technical Writer

ContractHaifa

Contract • Haifa

Backend Developer

ContractNovi Sad, RS

Contract • Novi Sad, RS

Backend Developer

Full-timeNew York, US

Full-time • New York, US

AI Researcher

ContractAustin, US

Contract • Austin, US

Data Scientist

Full-timeSan Francisco, US

Full-time • San Francisco, US

Product Designer

Part-timeCambridge, GB

Part-time • Cambridge, GB