
Protege is an AI training data platform that connects AI developers with data holders. For AI developers, Protege offers a vast collection of high-quality training data across numerous modalities and…

Protege is an AI training data platform that connects AI developers with data holders. For AI developers, Protege offers a vast collection of high-quality training data across numerous modalities and…
What they do: AI training-data platform that curates, licenses, and delivers real-world multimodal datasets for model development
Founded: 2024
Headquarters / HQ: New York (documented in profiles)
Recent funding: $30M Series A extension led by Andreessen Horowitz (Jan 2026); prior Series A $25M (Aug 2025); $10M seed (Sep 2024)
Founders / leadership: Bobby Samuels (CEO & co-founder); Travis May (Chairman & co-founder)
Bridging the gap between data holders and AI developers by enabling compliant, high-quality dataset sourcing and licensing for model training and evaluation.
2024
Data Infrastructure and Analytics
10000000
Seed round with participation from SV Angel, Liquid 2 Ventures, Bloomberg Beta, Flex Capital, Adam D'Angelo, Travis May, and others
25000000
30000000
Extension that expanded the August 2025 Series A, bringing cumulative funding to $65M since founding
“Includes participation from prominent investors such as CRV, Footwork, Andreessen Horowitz, Bloomberg Beta, Flex Capital, SV Angel, Liquid 2 Ventures, Adam D'Angelo, Travis May, and others”
| Company |
|---|
Company Overview: We are building Protege to solve the biggest unmet need in AI — getting access to the right training data. The process today is time intensive, incredibly expensive, and often ends in failure. The Protege platform facilitates the secure, efficient, and privacy-centric exchange of AI training data.
Solving AI’s data problem is a generational opportunity. We’re backed by world-class investors and already powering partnerships with some of the most ambitious teams in AI. The company that succeeds will be one of the largest in AI — and in tech.
We’re a lean, fast-moving, high-trust team of builders who are obsessed with velocity and impact. Our culture is built for people who thrive on ambiguity, own outcomes, and want to shape the future of data and AI.
Role Overview We’re hiring a Solutions Engineer for our media vertical to connect Protege’s media catalog with customer AI data needs. This is not a traditional modeling role. It is an applied data curation and delivery role for fast-moving, ambiguous environments where both speed and quality matter.
You will work with imperfect, evolving partner datasets and build strategies to normalize, validate, and operationalize them for downstream AI use cases. You’ll become an expert in Protege’s growing catalog of audio, video, and motion capture content — from longform assets with title-level metadata to clip-level content generated with TwelveLabs embeddings.
At a high level, you will understand what customers are building, identify the content that best fits their needs, and deliver datasets that meet both technical and conceptual requirements, often on tight timelines tied to active deals.
What You’ll Do Own data quality and curate media datasets
Be the catalog expert
Operate across product, data, and customer
Drive human-in-the-loop media search and curation
What Success Looks Like 30 days: Learn and get operational
60 days: Deliver and improve
90 days: Scale and influence
What You Bring
Bonus if you also have:
Working with Protege We move fast - thoughtfully. Speed matters in what we're building, but so does intention. We're biased toward action and always learning.
We're a lean, high-trust team. Everyone has real ownership. Clarity and autonomy drive our best work.
We take our work seriously, not ourselves. We solve hard problems with humility and celebrate wins - big and small.
We're kind, direct, and inclusive. We give feedback early and often, with the goal of helping one another grow.
We're builders at heart. Every person at Protege is hands-on, resourceful, and focused on creating momentum.
We grow fast - together. You'll be surrounded by people who care about impact, who challenge you to think bigger, and who are genuinely excited about what comes next.
Your next opportunity is in here somewhere. Sign up to explore 52,000+ startups and their open roles. No spam. No gamification. Just jobs.
52,000+
Startups
65,000+
Open Roles
1,500+
New This Week