
Supercharge Generative AI Inference Efficient, fast, and reliable generative AI inference solution for production

Supercharge Generative AI Inference Efficient, fast, and reliable generative AI inference solution for production
What they do: Managed inference cloud for deploying and serving large language and multimodal models with performance optimizations
HQ: Redwood City, California
Founded: 2021
Recent funding: $20M seed extension (Aug 28, 2025)
CEO / Founder: Byung‑Gon (Gon) Chun
AI inference infrastructure for large language and multimodal models
2021
Software Development
$20M
Round announced to expand AI inference platform, go-to-market, and product development
“Capstone Partners led the $20M seed extension with participation from Sierra Ventures, Alumni Ventures, KDB, and KB Securities”
| Company |
|---|
About The Job We’re seeking an Agent Engineer to design and build agentic features in our platform, including document understanding, advanced RAG, and customer support automation. In this role, you will develop not only the agent components themselves, but also the Friendli Agent API, which serves as the core developer interface for building and extending agent applications. You will also build agent applications as production-ready examples of how agents can solve real-world problems.
These applications will be primarily written in Python and will serve as reference implementations for our customers and community. We are looking for a hands-on engineer who is passionate about building agent systems and making AI easy for developers to adopt. The ideal candidate is comfortable creating agent applications that showcase what is possible, is curious about and experienced with open-source models, and enjoys turning them into reliable, high-impact features.
Key Responsibilities
Qualifications
Preferred Experience
Benefits
About FriendliAI FriendliAI is building the world’s best AI inference platform that makes large language and multi-modal models fast, efficient, and deployable at scale. We power high-throughput, low-latency AI workloads for organizations worldwide and integrate directly with Hugging Face, giving developers instant access to over 500,000 open-source models.
We are a small, fast-moving team doing work that matters at one of the most exciting moments in the history of technology. With our world-class inference engine, we are building a platform that the AI industry can actually rely on.
Your next opportunity is in here somewhere. Sign up to explore 52,000+ startups and their open roles. No spam. No gamification. Just jobs.
52,000+
Startups
65,000+
Open Roles
1,300+
New This Week