
Supercharge Generative AI Inference Efficient, fast, and reliable generative AI inference solution for production

Supercharge Generative AI Inference Efficient, fast, and reliable generative AI inference solution for production
What they do: Managed inference cloud for deploying and serving large language and multimodal models with performance optimizations
HQ: Redwood City, California
Founded: 2021
Recent funding: $20M seed extension (Aug 28, 2025)
CEO / Founder: Byung‑Gon (Gon) Chun
AI inference infrastructure for large language and multimodal models
2021
Software Development
$20M
Round announced to expand AI inference platform, go-to-market, and product development
“Capstone Partners led the $20M seed extension with participation from Sierra Ventures, Alumni Ventures, KDB, and KB Securities”
| Company |
|---|
About The Job We believe using large language and multimodal models should be as simple as calling an API. To achieve this in production, we need to serve enterprises across clouds, with authentication, billing, multi-tenant isolation, and zero tolerance for downtime.
We are looking for a Senior Backend Engineer who is excited by the full breadth of what it takes to run a platform in production. You will own the business logic layer that sits between our inference engine and every customer who relies on it. Your work spans API engineering, service development, and data architecture. If you like solving problems that only reveal themselves in the wild, this is your role: edge cases in multi-cloud orchestration, enterprise requirements that don’t fit neatly into a spec, performance bottlenecks that are hard to reproduce.
You will move across domains, make decisions under uncertainty, and build systems that work cleanly, reliably, and at scale. We are looking for people with a track record of owning complex systems in production and solving unique problems. A great candidate is a strong collaborator who enjoys solving complex architectural challenges, cares deeply about developer workflows, and is eager to help define the future of AI adoption.
Key Responsibilities
Qualifications
Preferred Experience
Benefits
About FriendliAI FriendliAI is building the world’s best AI inference platform that makes large language and multi-modal models fast, efficient, and deployable at scale. We power high-throughput, low-latency AI workloads for organizations worldwide and integrate directly with Hugging Face, giving developers instant access to over 500,000 open-source models.
We are a small, fast-moving team doing work that matters at one of the most exciting moments in the history of technology. With our world-class inference engine, we are building a platform that the AI industry can actually rely on.
Your next opportunity is in here somewhere. Sign up to explore 52,000+ startups and their open roles. No spam. No gamification. Just jobs.
52,000+
Startups
65,000+
Open Roles
1,300+
New This Week