
GMI Cloud provides scalable, high-performance GPU infrastructure to run and deploy AI and machine learning models. The platform offers GPU instances with top-tier NVIDIA accelerators, a Cluster…

GMI Cloud provides scalable, high-performance GPU infrastructure to run and deploy AI and machine learning models. The platform offers GPU instances with top-tier NVIDIA accelerators, a Cluster…
Headquarters: San Jose
Founded: 2023
Product: AI-native GPU cloud for production inference (APIs, orchestration, dedicated NVIDIA GPU compute)
Recent funding: $82M Series A (Oct 2024) led by Headline Asia
Compliance & SLAs: Enterprise SLAs and compliance (SOC 2, ISO 27001)
Production AI inference infrastructure and GPU cloud compute
2023
IT System Data Services
$82M
Round includes equity and debt; participation from Banpu NEXT and Wistron
“Led by Headline Asia with participation from Banpu NEXT and Wistron”
| Company |
|---|
About US
GMI Cloud is a fast-growing AI infrastructure company backed by Headline VC and one of only six cloud providers worldwide to earn NVIDIA’s prestigious Reference Platform Cloud Partner designation . We operate 8 of our own GPU clusters across the U.S. and Asia, delivering a full spectrum of services from GPU compute service to AI model inference API solutions. As an NVIDIA Reference Platform Cloud Partner, our infrastructure meets the highest standards for performance, security, and scalability in AI deployments. We empower AI startups and enterprises to “build AI without limits,” providing everything they need to prototype, train, and deploy AI models quickly and reliably.
About this role
We are looking for a Technical Program Manager (TPM) who combines strong program ownership, production sense, and execution rigor with a solid technical foundation in AI infrastructure and distributed systems .
In this role, you will own and drive complex, cross-functional programs that span GPU infrastructure, Kubernetes platforms, inference/training systems, and customer-facing AI services. You will ensure that high-impact initiatives move from design → implementation → production → scale , on time and with high quality.
This is a hands-on TPM role for someone who can:
Key Responsibilities
Program Ownership & Delivery Excellence
Technical & Production Sense
Cross-Functional Leadership
Platform & Infrastructure Programs
Required Skills
Program Management: Proven ability to drive cross-functional technical programs from design to production with strong ownership and execution discipline.
Production Sense: Strong judgment around API reliability, latency, scalability, rollout quality, and operational readiness for production AI services.
AI / LLM Systems: Solid understanding of LLM and multimodal model inference workflows, including text, image, audio, or video APIs.
Inference & Serving: Familiarity with model serving concepts such as throughput, tail latency, batching, streaming, and cost-performance tradeoffs.
AI Infrastructure: General understanding of GPU-based inference systems and their impact on performance and scalability.
Communication: Clear, structured communication with engineering, product, and leadership stakeholders.
Preferred Qualifications
Your next opportunity is in here somewhere. Sign up to explore 70,000+ startups and their open roles. No spam. No gamification. Just jobs.
70,000+
Startups
81,000+
Open Roles
4,500+
New This Week