
Pareto.AI is a talent-first human data collection platform for AI research, empowering the top 0.01% of expert labelers to deliver the highest-quality training data. Please note that all…

Pareto.AI is a talent-first human data collection platform for AI research, empowering the top 0.01% of expert labelers to deliver the highest-quality training data. Please note that all…
What they do: Talent-first remote data-labeling and AI training services (high-quality training data, data enrichment, B2B lead gen, outsourced ops)
Founded by: Phoebe Yao
Notable funding: Seed round led by MaC Venture Capital (company announced $4.5M seed)
Workforce: Distributed/remote workforce focused on upskilling (roles: AI Trainer / data labeler)
Training data quality and scalable human evaluation for AI models; outsourced data and operations support for businesses.
Software Development
$4.5M
Company-authored blog post announced a $4.5M seed raise.
“Includes venture investors such as MaC Venture Capital; Crunchbase lists additional investors including Fearless Fund and Slope Agency (multiple investors reported).”
| Company |
|---|
About Us At Pareto.AI, we’re on a mission to enable top talent around the world to participate in the development of cutting-edge AI models.
In coming years, AI models will transform how we work and create thousands of new AI training jobs for skilled talent around the world. We’ve joined forces with top AI and crowd researchers at Anthropic, Character.AI, Imbue, Stanford, and University of Pennsylvania to build a fair and ethical platform for AI developers to collaborate with domain experts to train bespoke AI models.
Overview We're looking for a Technical Partnerships Lead to own the end-to-end strategy and execution of data acquisition that powers frontier AI model training. This role sits at the intersection of AI research, partnerships, and growth—you'll work closely with our technical teams to understand what data we need, then figure out how to get it.
This is a zero-to-one role requiring technical fluency to understand what makes high-quality training data for frontier AI systems, combined with creative problem-solving to source it from everywhere: research labs, niche startups, enterprises, and specialist communities. The right person gets excited about the challenge of finding complex, realistic data in unconventional places—someone who can think strategically about where valuable data exists, build trust across wildly different industries, and execute relentlessly.
If you thrive at the intersection of technical depth, creative sourcing, and operational excellence, we'd love to hear from you.
You'll Be The Person Who Owns data acquisition for frontier AI training
Finds Data Where Others Don't Look
Closes deals across wildly different industries
Translates between technical and commercial
Builds What Doesn't Exist Yet
Who Thrives Here You likely have:
We'd Be Especially Excited If You Have
You Won't Thrive Here If
What Makes This Role Unique The data challenge: You're not sourcing commodity datasets. You're finding complex, realistic, frontier training data that doesn't exist in standard marketplaces. This requires creative thinking about where data lives and how to access it.
The relationship challenge: In a single week, you might negotiate with a bankruptcy trustee, a university IRB committee, a Fortune 500 legal team, and a frontier AI researcher. Each requires completely different approaches.
The building challenge: You're creating this function from scratch. What works for sourcing medical data won't work for legal data. You'll need to experiment, learn fast, and build tailored approaches.
The impact: The data partnerships you build directly determine what our models can learn and how we differentiate in the market. High leverage, high visibility.
Compensation Range: $130K - $170K
Your next opportunity is in here somewhere. Sign up to explore 52,000+ startups and their open roles. No spam. No gamification. Just jobs.
52,000+
Startups
65,000+
Open Roles
1,500+
New This Week