
Supercharge Generative AI Inference Efficient, fast, and reliable generative AI inference solution for production

Supercharge Generative AI Inference Efficient, fast, and reliable generative AI inference solution for production
What they do: High-performance generative AI inference tooling and managed platforms for deploying, scaling, and monitoring large language and multimodal models
Founded: 2021
HQ / hubs: Redwood City, California; hub in Seoul, Korea
Recent financing: $20M seed extension led by Capstone Partners (announced Aug 28, 2025)
Founder / CEO: Byung-Gon Chun
Generative AI inference infrastructure for production deployments of LLMs and multimodal models
2021
Software Development
$20M
Participation from Sierra Ventures, Alumni Ventures, KDB Investment, and KB Securities (announced by company)
$6M
Prior seed round reported in late 2021
“Led by Capstone Partners with participation from Sierra Ventures, Alumni Ventures, KDB Investment, and KB Securities”
| Company |
|---|
Staff Engineer
Location: San Francisco, CA / San Mateo, CA / Seoul, KR
About the Job
FriendliAI is seeking a Staff Software Engineer to provide technical leadership in building the core systems behind our AI inference platforms. This role operates with broad technical scope with company-wide impact , shaping architectural decisions that affect multiple products, teams, and customer workloads. You will work at the intersection of distributed systems, AI model serving, agent execution platforms, and developer infrastructure, building foundational systems that power production-scale inference.
As a Staff Engineer, you will define long-term technical direction, lead complex multi-quarter initiatives, and influence engineering decisions well beyond your immediate team. This is a deeply hands-on role with significant architectural ownership and cross-functional impact.
Key Responsibilities
Qualifications
Preferred Experience
Benefits
About us
FriendliAI is building the next-generation AI inference platform that accelerates the deployment of large language and multimodal models with unmatched performance and efficiency. Our infrastructure powers high-throughput, low-latency workloads for global organizations and integrates directly with Hugging Face, providing instant access to over 480,000 open-source models. We are on a mission to deliver the world’s best platform for AI inference.
Your next opportunity is in here somewhere. Sign up to explore 52,000+ startups and their open roles. No spam. No gamification. Just jobs.
52,000+
Startups
58,000+
Open Roles
2,700+
New This Week