At Baseten we provide all the infrastructure you need to deploy and serve ML models performantly, scalably, and cost-efficiently.
Get started in minutes, and avoid getting tangled in complex deployment processes. You can deploy best-in-class open-source models and take advantage of optimized serving for your own models.
We also utilize horizontally scalable services that take you from prototype to production, with light-speed inference on infra that autoscales with your traffic.
Best in class doesn't mean breaking the bank. Run your models on the best infrastructure without running up costs by taking advantage of our scaled-to-zero feature.
At Baseten we provide all the infrastructure you need to deploy and serve ML models performantly, scalably, and cost-efficiently.
Get started in minutes, and avoid getting tangled in complex deployment processes. You can deploy best-in-class open-source models and take advantage of optimized serving for your own models.
We also utilize horizontally scalable services that take you from prototype to production, with light-speed inference on infra that autoscales with your traffic.
Best in class doesn't mean breaking the bank. Run your models on the best infrastructure without running up costs by taking advantage of our scaled-to-zero feature.
Notable recent funding: $300M Series E (company disclosure)
Related Companies
Company
HQ
Industry
Total Funding
Weights & Biases
🇺🇸US
Software
$250M
Modular
🇺🇸US
Data and AnalyticsDeepTechInformation TechnologySoftware
$380M
SuperAnnotate
🇺🇸US
Data and AnalyticsDeepTechHardwareInformation TechnologySoftware
$18M
LangChain
🌍Remote
Data and AnalyticsDeepTechInformation TechnologySoftware
-
Reflection AI
🇺🇸US
DeepTech
-
Company Overview
Problem Domain
Production inference and serving for machine-learning models (including LLMs) with emphasis on scalability, performance, and cost control.
Founded
2019
Industry
Software Development
Funding Track Record
Series B- March 2024
$40M
Series C- February 2025
$75M
Series D- September 2025
$150M
Series E
$300M
Company disclosure reporting $300M Series E at $5B valuation
Investor Signal
“Baseten has raised late-stage rounds with participation from investors including Bond, IVP, CapitalG, Spark Capital, NVIDIA, Greylock, Conviction, 01 Advisors, BoxGroup, and others.”
Founders
What we do
Join the Team
Tech Lead Manager
On-SiteSan Francisco Bay Area, US
On-Site • San Francisco Bay Area, US
Who you are
Bachelor’s degree in Computer Science, Engineering, or related field, or equivalent experience
5+ years of experience in ML infrastructure, distributed systems, or ML platform engineering, including 2+ years in a tech lead or manager role
Strong expertise in distributed training frameworks and orchestration (FSDP, DDP, ZeRO, Ray, Kubernetes, Slurm, or similar)
Hands-on experience building or scaling training infrastructure for LLMs or other foundation models
Deep understanding of GPU/accelerator hardware utilization, mixed precision training, and scaling efficiency
Proven ability to lead and mentor technical teams while delivering complex infrastructure projects
Excellent communication skills, with the ability to bridge technical depth and business needs
Experience with multi-tenant, production-grade ML platforms
Familiarity with cluster management, GPU scheduling, or elastic resource scaling
Knowledge of advanced model adaptation techniques (LoRA, QLoRA, RLHF, DPO)
Contributions to open-source distributed training or ML infrastructure projects
Experience building developer-friendly APIs or SDKs for ML workflows
Remote-first work environment. The Baseten team is welcome to work from wherever they want; fully remote, in our San Francisco office, or a mix of both. Today, our team (including our founding team) is spread across the United States, Canada, and Armenia. We provide a $1,000 stipend for you to make your home-office comfortable and productive
Regular in-person team summits. We get together as a team three times a year to plan, workshop, and most importantly, get to know each other better
Unlimited PTO. We ask that everyone take at least 4 weeks of vacation. And we have a company-wide break between Christmas and New Year's Day
Full healthcare coverage. Medical, dental and vision insurance for you and your family
Teeming tracks opportunities at over 24,000 AI startups, then works with you to find (and land) the one you'll love.
Mobile Developer
Part-timeAmsterdam, NL
Part-time • Amsterdam, NL
AI Researcher
Part-timeRotterdam, NL
Part-time • Rotterdam, NL
Technical Writer
ContractTel Aviv
Contract • Tel Aviv
Product Designer
Full-timeHaifa
Full-time • Haifa
Frontend Developer
InternshipManchester, GB
Internship • Manchester, GB
Frontend Developer
Full-timeAustin, US
Full-time • Austin, US
As a Tech Lead Manager of the Training team at Baseten, you’ll lead a team of engineers building the core systems that power large-scale training and fine-tuning of foundation models
Your team will be responsible for designing scalable, reliable, and efficient infrastructure - covering distributed training frameworks, GPU scheduling, and training pipelines—enabling both Baseten and our customers to train and adapt models at scale
You’ll balance hands-on technical contributions with people management, setting the technical direction while fostering the growth and success of your team
You’ll also play a key role in defining Baseten’s platform roadmap by identifying common infrastructure needs and turning them into reusable, self-serve capabilities
Lead, mentor, and grow a team of engineers building Baseten’s training infrastructure
Define and drive the technical strategy for large-scale training systems, with a focus on scalability, reliability, and efficiency
Architect and optimize distributed training pipelines across heterogeneous GPU/accelerator environments
Balance hands-on contributions (system design, code reviews, prototyping) with people leadership and career development
Establish best practices for training workflows, distributed systems design, and high-performance model evaluation
Collaborate with Product and Platform Engineering to translate customer and internal needs into reusable infrastructure and APIs
Develop processes that ensure consistent, reliable, and on-time delivery of high-quality systems
Stay ahead of the curve on advancements in training efficiency (FSDP, ZeRO, parameter-efficient training, hardware-aware scheduling) and bring them into production
Paid parental leave. 16-weeks fully paid parental leave (adoptive and non-birth parents included) and flexibility with schedules while returning to work
Company-sponsored 401(k) for you to contribute to
Learning and development budget. We encourage you to take classes, attend conferences, and invest in your craft and we’ll cover expenses to make it happen