
Defined.ai operates a marketplace and subscription platform that supplies ethically sourced, annotated training data and custom data services to teams building AI models. It curates and sells…

Defined.ai operates a marketplace and subscription platform that supplies ethically sourced, annotated training data and custom data services to teams building AI models. It curates and sells…
What they do: Ethically sourced AI training data, marketplace, subscriptions, and custom data services
Founded: 2015 by Daniela Braga (Seattle, WA)
Funding (approx.): Raised tens of millions; disclosed Series B $50.5M (May 2020); total reported ~ $78–80M+
Headquarters: Seattle, Washington, USA (R&D center in Lisbon, Portugal)
Supply of high‑quality, ethically sourced training data for machine learning and AI development.
2015
IT Services and IT Consulting
$50.5M
Series B disclosed at $50.5M
“Includes strategic investors such as Amazon Alexa Fund, Sony Innovation Fund and corporate backers; Series A led by Evolution Equity Partners”
Description
Defined.ai is a leading provider of high-quality, ethically sourced data for Artificial Intelligence (AI) and Machine Learning (ML) model training. We host the world's largest AI marketplace and offer end-to-end services to help companies accelerate their AI solutions. Backed by significant funding and recognized globally for our commitment to ethical AI, we operate in a fast-paced, innovative environment with offices in Seattle and Lisbon.
This is a hybrid position in Lisbon, or remote position (outside Lisbon).
What will you do?
Pipeline Orchestration
Your next opportunity is in here somewhere. Sign up to explore 70,000+ startups and their open roles. No spam. No gamification. Just jobs.
70,000+
Startups
80,000+
Open Roles
3,900+
New This Week
| Company |
|---|
Data Transformation
Data Ingestion & Python Development
Data Modeling & Analytics Enablement
Quality, Observability & Reliability
Collaboration & Product Thinking
Who are we looking for?
We’re looking for a proactive, product-minded data engineer who enjoys building reliable systems, improving developer experience, and turning raw data into trusted insights.
Hands-on experience with:
Solid understanding of modern data architectures:
Experience with at least one major cloud platform:
AWS (S3, Glue, Athena), GCP , or Azure
Familiarity with CI/CD pipelines (GitHub Actions, Azure DevOps, etc.)
Experience consuming and building REST APIs (e.g. FastAPI)
Strong problem-solving skills and a pragmatic engineering mindset
Professional proficiency in English (spoken and written)
Nice to have
Benefits
You spend a lot of your time at work, so it should be challenging, fun and interesting. At Defined.ai it will be all of those things and more. Here’s what we offer:
Privacy Notice: defined.ai/candidate-privacy-statement