
Protege is an AI training data platform that shortens procurement time for high-quality, multimodal training datasets while preserving governance and intellectual property controls. The platform…

Protege is an AI training data platform that shortens procurement time for high-quality, multimodal training datasets while preserving governance and intellectual property controls. The platform…
What they do: AI training-data platform that curates, licenses, and delivers real-world multimodal datasets for model development
Founded: 2024
Headquarters / HQ: New York (documented in profiles)
Recent funding: $30M Series A extension led by Andreessen Horowitz (Jan 2026); prior Series A $25M (Aug 2025); $10M seed (Sep 2024)
Founders / leadership: Bobby Samuels (CEO & co-founder); Travis May (Chairman & co-founder)
Bridging the gap between data holders and AI developers by enabling compliant, high-quality dataset sourcing and licensing for model training and evaluation.
2024
Data Infrastructure and Analytics
10000000
Seed round with participation from SV Angel, Liquid 2 Ventures, Bloomberg Beta, Flex Capital, Adam D'Angelo, Travis May, and others
25000000
30000000
Extension that expanded the August 2025 Series A, bringing cumulative funding to $65M since founding
“Includes participation from prominent investors such as CRV, Footwork, Andreessen Horowitz, Bloomberg Beta, Flex Capital, SV Angel, Liquid 2 Ventures, Adam D'Angelo, Travis May, and others”
| Company |
|---|
Company Overview: We are building Protege to solve the biggest unmet need in AI — getting access to the right training data. The process today is time intensive, incredibly expensive, and often ends in failure. The Protege platform facilitates the secure, efficient, and privacy-centric exchange of AI training data.
Solving AI’s data problem is a generational opportunity. We’re backed by world-class investors and already powering partnerships with some of the most ambitious teams in AI. The company that succeeds will be one of the largest in AI — and in tech.
We’re a lean, fast-moving, high-trust team of builders who are obsessed with velocity and impact. Our culture is built for people who thrive on ambiguity, own outcomes, and want to shape the future of data and AI.
Role Overview: Our Business Development Representative is responsible for building a high-quality healthcare pipeline through disciplined, insight-led outbound prospecting. Focusing on organizations that provide real-world data for model development, you’ll run targeted sequences, qualify opportunities rigorously, and be responsible for daily prospecting execution.
Key Responsibilities:
What Success Looks like:
30 days: Learn the Healthcare motion + start producing
Learn Protege’s healthcare positioning, core use cases, and target personas. Build a clean outbound workflow: target account list, sequences, call talk tracks, and daily activity targets. Begin consistent outbound execution and start booking initial qualified meetings
What you bring:
Working with Protege:
Why Protege:
Your next opportunity is in here somewhere. Sign up to explore 70,000+ startups and their open roles. No spam. No gamification. Just jobs.
70,000+
Startups
81,000+
Open Roles
4,500+
New This Week
60 days: Run repeatable outbound plays + tighten conversion
Operate as a quota-bearing Healthcare BDR with steady weekly output. Refine ICP targeting and messaging based on results and feedback from Sales/GM. Lock qualification criteria and handoff rules, and document 1–2 outbound plays (targeting + messaging + sequence + talk track) that can be reused
90 days: Scale what works + make pipeline predictable
Scale the highest-performing outbound plays while improving meeting quality (fewer bad-fit meetings). Improve conversion (meeting → opportunity) and show rates. Deliver a simple 90-day retro with what worked, what did not, and the next 1–2 experiments to run, plus any enablement/tooling gaps needed to hit pipeline goals