SemiAnalysis provides independent research and analysis on the semiconductor and AI industries to inform strategic decisions. They deliver retained advisory, bespoke projects, and data products across the supply chain, from fabrication to AI models and software. Key offerings include industry models such as the Accelerator Industry Model, AI Cloud TCO Model, Datacenter Industry Model, and Wafer Fab Model, plus data products like GPU Rental Pricing and Semiconductor Import-Export Statistics. Their services target hedge funds, asset managers, corporate strategy teams, and semiconductor/AI professionals. The company emphasizes a product-first approach and leverages technologies such as Python, JAX, PyTorch, vLLM, Kubernetes, and SLURM.
SemiAnalysis provides independent research and analysis on the semiconductor and AI industries to inform strategic decisions. They deliver retained advisory, bespoke projects, and data products across the supply chain, from fabrication to AI models and software. Key offerings include industry models such as the Accelerator Industry Model, AI Cloud TCO Model, Datacenter Industry Model, and Wafer Fab Model, plus data products like GPU Rental Pricing and Semiconductor Import-Export Statistics. Their services target hedge funds, asset managers, corporate strategy teams, and semiconductor/AI professionals. The company emphasizes a product-first approach and leverages technologies such as Python, JAX, PyTorch, vLLM, Kubernetes, and SLURM.
We are seeking a Member of Technical Staff to join our team working on ClusterMAX™, the industry standard GPU Cloud rating system. We are hiring at all experience levels with competitive compensation.
As part of the interview process, you will complete a paid coding challenge designed to reflect a day in the life on the SemiAnalysis ML Systems team.
Key Responsibilities
Teeming tracks opportunities at over 24,000 AI startups, then works with you to find (and land) the one you'll love.
AI Researcher
ContractUtrecht, NL
Contract • Utrecht, NL
Machine Learning Engineer
Part-timeBerlin, DE
Part-time • Berlin, DE
Machine Learning Engineer
Part-timeLondon, GB
Part-time • London, GB
Software Engineer
Part-timeNew York, US
Part-time • New York, US
Technical Writer
Part-timeRotterdam, NL
Part-time • Rotterdam, NL
Product Designer
InternshipLondon, GB
Internship • London, GB
Lead the development of next generation benchmarks and TCO analysis for publication in future versions of ClusterMAX™ and related projects.
Collaborate with executives and engineers from over 203 neoclouds, hyperscalers, marketplaces, and sovereign projects such as AWS, Azure, GCP, Oracle, CoreWeave, Nebius, Crusoe, Lambda, and Together
Establish and maintain partnership with AI Chip manufacturers, startups and OEMs such as NVIDIA, AMD, Intel, Google, Amazon, Cerebras, Groq, SuperMicro, Dell, HPE, Lenovo and Cisco
Build on existing relationships with leading AI labs, investors, startups, and community members to gauge their experience working with cloud providers and contribute expertise
Author detailed technical research reports analyzing benchmark results, reliability, and ease of use
Stay current on emerging trends and technologies by attending major conferences such as NeurIPS, MLSys, NVIDIA GTC, OCP, SC, HotChips and more. Travel is encouraged but not required
Qualifications
Experience with ML frameworks such as PyTorch or JAX
Experience with GPU or TPU clusters running kubernetes or slurm
Experience with filesystems such as Weka, VAST, Lustre, and GPFS
Experience with interconnects such as InfiniBand and RoCEv2
Experience with ML system benchmarking (GEMMs, nccl-tests, vllm, sglang, mlperf, STAC, HPL, FIO, torchtitan, megatron, etc.)
Experience working at a hyperscaler, neocloud, server OEM, chip manufacturer, or large scale user of these technologies (preferred)
Proactive and capable of working in a global, distributed team