GMI Cloud

GMI Cloud’s mission is to empower anyone to deploy and scale AI effortlessly. We deliver seamless access to top-tier GPUs and a streamlined ML/LLM software platform for integration, virtualization,…

gmicloud.ai

GMI Cloud

gmicloud.ai

HQUS

Team Size114

Open Jobs4

Total Funding-

Latest FundraiseUnknown

TL;DR

Headquarters: San Jose

Founded: 2023

Product: AI-native GPU cloud for production inference (APIs, orchestration, dedicated NVIDIA GPU compute)

Recent funding: $82M Series A (Oct 2024) led by Headline Asia

Compliance & SLAs: Enterprise SLAs and compliance (SOC 2, ISO 27001)

Company Overview

Problem Domain

Production AI inference infrastructure and GPU cloud compute

Founded

2023

Industry

IT System Data Services

Tech Stack

NVIDIA H100

NVIDIA H200

NVIDIA Blackwell-class GPUs

Kubernetes

Serverless inference APIs

Funding Track Record

Series A- October 2024

$82M

Round includes equity and debt; participation from Banpu NEXT and Wistron

Investor Signal

“Led by Headline Asia with participation from Banpu NEXT and Wistron”

Founders

What we do

Join the Team

Solutions Architect

HybridNeihu District, Taipei City

Hybrid • Neihu District, Taipei City

Related Companies

Company	HQ	Industry	Total Funding
GRUVE TECHNOLOGIES INDIA PRIVATE LIMITED	🌍Remote	Data and AnalyticsDeepTechInformation TechnologySoftware	-
Ema	🇺🇸San Francisco, US	Administrative ServicesData and AnalyticsDeepTechHealthHR and RecruitingInformation TechnologyProfessional ServicesSoftware	$61M
SpiNNcloud	🇩🇪Dresden, DE	BiotechnologyDeepTechHardwareInformation TechnologyManufacturing	$15M
Mistral AI	🇫🇷FR	Data and AnalyticsDeepTechInformation TechnologySoftware	$3B
Zime	🇺🇸US	Data and AnalyticsInformation TechnologySales and MarketingSoftware	-

Overview

We are seeking a highly skilled Solution Architect with strong expertise in GPU-based cloud infrastructure, capable of bridging technical architecture and business strategy. This role will design scalable GPU cloud solutions, work closely with customers and partners, and translate complex requirements into actionable architectures and business value.

Key Responsibilities

Technical Architecture

Design and architect GPU cloud platforms (including H100/H200/B200/L40S, GB200/GB300 clusters, multi-rack setup).
Plan and optimize infrastructure topology, including network, storage, security, GPU scheduling, and virtualization/containerization (Kubernetes, Slurm, etc.).
Evaluate hardware options and set clear performance benchmarks/TCO/performance per watt.
Define best practices for MLOps / LLM training / inference stacks.
Provide reference architectures and solution playbooks for different customer use cases.

Pre-Sales & Business Enablement

-Work with customers to understand business needs and translate them into technical solutions.

Prepare solution proposals, cost estimates, TCO analysis, and ROI models.
Present technical solutions to executives, VPs, CTOs, or procurement teams.
Support proof-of-concepts (POC), demo environments, and customer onboarding.
Communicate competitive advantages and differentiate services against AWS / Azure / other GPU providers.

Cross-Team Collaboration

Work with product, engineering, and operations teams to ensure solution feasibility.
Provide feedback for roadmap planning and service offerings.
Collaborate with data center teams on capacity planning, expansion strategy, and reliability.
Document solution standards, guidelines, and operational run-books.

Customer Success & Long-Term Strategy

Act as a trusted technical advisor for key enterprise customers.
Propose scaling strategies, cost optimization, and continuous performance improvements.
Gather customer requirements to influence product direction & pricing strategy.
Build long-term architecture visions and solution frameworks for AI workloads.

Qualifications Required

Bachelor’s/Master’s in Computer Science, Engineering, or related field.
5+ years experience in cloud architecture / infrastructure / solution engineering.
Strong understanding of GPU workloads, parallel computing, AI/ML pipelines, and LLM training/inference.
Hands-on knowledge of:

o Kubernetes / Docker / Slurm / Ray

o Linux, HPC, networking fundamentals

o GPU resource management & scheduling

Experience in customer-facing technical roles (pre-sale, consulting, PoC, enterprise projects).
Proven ability to explain complex ideas to business stakeholders and non-technical audiences.

Preferred

Experience with data center operations or multi-rack GPU deployment.
Familiar with cloud economics / TCO analysis / business modeling.
Strong presentation skills & ability to write proposals.
Understanding of security/compliance standards (ISO27001, SOC2, etc.).
Multi-language ability (English / Chinese / Japanese) is a plus.

Soft Skills

Solution-oriented and business-driven mindset.
Strong communication and client engagement skills.
Able to work independently under pressure.
Strategic thinker with hands-on execution ability.
Team player across departments (Product, Ops, Engineering, Sales).

GMI Cloud

GMI Cloud

TL;DR

Company Overview

Problem Domain

Founded

Industry

Tech Stack

Funding Track Record

Investor Signal

Founders

What we do

Join the Team

Solutions Architect

Related Companies

Startup jobs. A lot of them.

Data Scientist

Product Designer

Frontend Developer

Product Designer

Data Scientist

Data Scientist