
GMI Cloud’s mission is to empower anyone to deploy and scale AI effortlessly. We deliver seamless access to top-tier GPUs and a streamlined ML/LLM software platform for integration, virtualization,…

GMI Cloud’s mission is to empower anyone to deploy and scale AI effortlessly. We deliver seamless access to top-tier GPUs and a streamlined ML/LLM software platform for integration, virtualization,…
What they do: GPU-optimized cloud infrastructure and software for training and deploying large AI models
Founded / HQ: 2023; San Jose / Mountain View area (California)
Scale / team: ~100 employees
Recent funding: $82M Series A (Oct 2024; $15M equity + $67M debt); total capital reported ~$93M
Key partners / investors: Headline Asia (lead), Banpu, Wistron
Infrastructure and platform support for large-scale AI model training and inference
2023
IT System Data Services
$82M (reported; structure: $15M equity + $67M debt)
Round included strategic participants such as Banpu and Wistron; debt component reported as $67M.
“Led by growth/regionally-focused lead investor (Headline Asia) with strategic corporate participants (Banpu, Wistron) and significant debt financing in the round”
| Company |
|---|
About US
GMI Cloud is a fast-growing AI infrastructure company backed by Headline VC and one of only six cloud providers worldwide to earn NVIDIA’s prestigious Reference Platform Cloud Partner designation . We operate 8 of our own GPU clusters across the U.S. and Asia, delivering a full spectrum of services from GPU compute service to AI model inference API solutions. As an NVIDIA Reference Platform Cloud Partner, our infrastructure meets the highest standards for performance, security, and scalability in AI deployments. We empower AI startups and enterprises to “build AI without limits,” providing everything they need to prototype, train, and deploy AI models quickly and reliably.
About this role
We are looking for a Technical Program Manager (TPM) who combines strong program ownership, production sense, and execution rigor with a solid technical foundation in AI infrastructure and distributed systems .
In this role, you will own and drive complex, cross-functional programs that span GPU infrastructure, Kubernetes platforms, inference/training systems, and customer-facing AI services. You will ensure that high-impact initiatives move from design → implementation → production → scale , on time and with high quality.
This is a hands-on TPM role for someone who can:
Key Responsibilities
Program Ownership & Delivery Excellence
Technical & Production Sense
Cross-Functional Leadership
Platform & Infrastructure Programs
Required Skills
Program Management: Proven ability to drive cross-functional technical programs from design to production with strong ownership and execution discipline.
Production Sense: Strong judgment around API reliability, latency, scalability, rollout quality, and operational readiness for production AI services.
AI / LLM Systems: Solid understanding of LLM and multimodal model inference workflows, including text, image, audio, or video APIs.
Inference & Serving: Familiarity with model serving concepts such as throughput, tail latency, batching, streaming, and cost-performance tradeoffs.
AI Infrastructure: General understanding of GPU-based inference systems and their impact on performance and scalability.
Communication: Clear, structured communication with engineering, product, and leadership stakeholders.
Preferred Qualifications
Your next opportunity is in here somewhere. Sign up to explore 52,000+ startups and their open roles. No spam. No gamification. Just jobs.
52,000+
Startups
65,000+
Open Roles
1,500+
New This Week