
Leading edge research, proprietary models and agents, and a forward-deployed implementation approach for tangible ROI

Leading edge research, proprietary models and agents, and a forward-deployed implementation approach for tangible ROI
Headquarters / Origin: Founded in Paris
Core focus: Agentic "virtual humanoid" AI agents and proprietary foundation models
Flagship products: H Platform, Surfer agent framework, Holo model family (including Holo2)
Team size (approx.): ~329 employees; research lab ~70+ researchers
Notable funding event: $220M seed round announced May 2024
| Company |
|---|
Automating enterprise workflows through agentic AI that can perform web/desktop/mobile computer-use.
Artificial intelligence / Enterprise automation
$220M
Headline $220M seed round announced May 2024; multiple investors reported across the round
Crunchbase lists a convertible note dated May 7, 2024
“Backed by a broad syndicate of venture and strategic investors (reported ~19 backers, including corporate and individual investors)”
About H: H exists to push the boundaries of superintelligence with agentic AI. By automating complex, multi-step tasks typically performed by humans, AI agents will help unlock full human potential.
H is hiring the world’s best AI talent, seeking those who are dedicated as much to building safely and responsibly as to advancing disruptive agentic capabilities. We promote a mindset of openness, learning, and collaboration, where everyone has something to contribute.
About the Team: The Inference team develops and enhances the inference stack for serving H-models that power our agent technology. The team focuses on optimizing hardware utilization to reach high throughput, low latency and cost efficiency in order to deliver a seamless user experience.
Key Responsibilities:
Requirements:
Location:
What we offer:
If you want to change the status quo in AI, join us.
Technical skills:
MS or PhD in Computer Science, Machine Learning or related fields
Proficient in at least one of the following programming languages: Python, Rust or C/C++
Experience in GPU programming such as CUDA, Open AI Triton, Metal, etc.
Experience in model compression and quantization techniques
Soft skills
Collaborative mindset, thriving in dynamic, multidisciplinary teams
Strong communication and presentation skills
Eager to explore new challenges
Bonuses:
Experience with LLM serving frameworks such as vLLM, TensorRT-LLM, SGLang, llama.cpp, etc.
Experience with CUDA kernel programming and NCCL
Experience in deep learning inference framework (Pytorch/execuTorch, ONNX Runtime, GGML, etc.)