
Defined.ai is the leading provider of ethical AI data, offering the world’s biggest ethical AI data marketplace alongside subscriptions for flexible data access and custom services. With deep expertise in artificial intelligence and machine learning, Defined.ai delivers high-quality, ethically sourced training data, enabling companies to accelerate their AI solutions with data that is secure, bias-free, and compliant with ethical and legal standards. Founded by Daniela Braga, PhD, in 2015, Defined.ai has earned recognition in top-tier outlets, including Forbes, Fortune, Gartner, CB Insights, and Inc., and has received numerous awards, with appearances on prestigious lists like Forbes AI 50, Deloitte Fast 100, and Inc. 500. The company has raised over $80 million in funding and is headquartered in Seattle, WA, USA, with additional offices in Lisbon, Portugal.

Defined.ai is the leading provider of ethical AI data, offering the world’s biggest ethical AI data marketplace alongside subscriptions for flexible data access and custom services. With deep expertise in artificial intelligence and machine learning, Defined.ai delivers high-quality, ethically sourced training data, enabling companies to accelerate their AI solutions with data that is secure, bias-free, and compliant with ethical and legal standards. Founded by Daniela Braga, PhD, in 2015, Defined.ai has earned recognition in top-tier outlets, including Forbes, Fortune, Gartner, CB Insights, and Inc., and has received numerous awards, with appearances on prestigious lists like Forbes AI 50, Deloitte Fast 100, and Inc. 500. The company has raised over $80 million in funding and is headquartered in Seattle, WA, USA, with additional offices in Lisbon, Portugal.
Core offering: Ethically sourced AI training data marketplace and custom data services
Founded: 2015
Founder & CEO: Dr. Daniela Braga
Headquarters: Seattle, WA, USA
Reported total funding: ~$80M (company reports $85M+ in some materials)
| Company |
|---|
AI training data quality, ethical sourcing, and dataset provisioning for machine learning/LLMs.
2015
IT Services and IT Consulting
$50.5M
Series B included participation from Semapa Next and Hermes GPE alongside existing investors.
$11.8M
$1.1M
Seed investors included Sony and Amazon Alexa Fund among others.
“Includes corporate strategic backers (Amazon Alexa Fund, Sony Innovation Fund, Mastercard) and growth/VC firms (Evolution Equity Partners, Kibo Ventures, Portugal Ventures, Semapa Next, Hermes GPE)”
Description
Defined.ai is a leading provider of high-quality, ethically sourced data for Artificial Intelligence (AI) and Machine Learning (ML) model training. We host the world's largest AI marketplace and offer end-to-end services to help companies accelerate their AI solutions. Backed by significant funding and recognized globally for our commitment to ethical AI, we operate in a fast-paced, innovative environment with offices in Seattle and Lisbon.
This is a hybrid position in Lisbon, or remote position (outside Lisbon).
What will you do?
Pipeline Orchestration
Data Transformation
Data Ingestion & Python Development
Data Modeling & Analytics Enablement
Quality, Observability & Reliability
Collaboration & Product Thinking
Who are we looking for?
We’re looking for a proactive, product-minded data engineer who enjoys building reliable systems, improving developer experience, and turning raw data into trusted insights.
Hands-on experience with:
Solid understanding of modern data architectures:
Experience with at least one major cloud platform:
AWS (S3, Glue, Athena), GCP , or Azure
Familiarity with CI/CD pipelines (GitHub Actions, Azure DevOps, etc.)
Nice to have
Benefits
You spend a lot of your time at work, so it should be challenging, fun and interesting. At Defined.ai it will be all of those things and more. Here’s what we offer:
Privacy Notice: defined.ai/candidate-privacy-statement
Experience consuming and building REST APIs (e.g. FastAPI)
Strong problem-solving skills and a pragmatic engineering mindset
Professional proficiency in English (spoken and written)