
Zero distance innovation for GenAI creators and industries Expertly engineering platforms and curating multimodal, multilingual data, we empower the ‘Magnificent Seven’ and enterprise clients with safe, scalable AI deployment We a team of over 150 PhDs and data scientists, along with more than 4,000 AI practitioners and engineers. We bring platforms, partners and 1.8 million vertical domain experts to create high-quality pre-trained datasets, fine-tuned industry-specific LLMs, and RAG pipelines supported by vector databases. These innovations can reduce GenAI costs by up to 80% and bring GenAI solutions to market 50% faster in 230 locales.

Zero distance innovation for GenAI creators and industries Expertly engineering platforms and curating multimodal, multilingual data, we empower the ‘Magnificent Seven’ and enterprise clients with safe, scalable AI deployment We a team of over 150 PhDs and data scientists, along with more than 4,000 AI practitioners and engineers. We bring platforms, partners and 1.8 million vertical domain experts to create high-quality pre-trained datasets, fine-tuned industry-specific LLMs, and RAG pipelines supported by vector databases. These innovations can reduce GenAI costs by up to 80% and bring GenAI solutions to market 50% faster in 230 locales.
What they do: Global AI data and platform company providing dataset generation, RLHF/human-in-the-loop pipelines, and a marketplace to accelerate model-to-production
Founded / HQ: 2020; Redmond, Washington
Team size: Over 3,000 employees (company snapshot lists 3,126)
Recent funding: $60M Series A (announced June 24, 2025), lead investor Granite Asia
AI training data, model evaluation and deployment workflows for enterprises and large-scale models
2020
Software Development
$60 million
Company announcement of Series A to scale AI infrastructure and product roadmap
“Led by Granite Asia (Series A)”
| Company |
|---|
Role Overview
Centifics DAC Command builds AI- and data-driven automation that must be trustworthy, privacy-aware, and auditable. The Data & Governance Architect is responsible for defining data governance architecture—classification, access, lineage, retention, and privacy controls—across pipelines, products, and platforms.
You will partner with data engineering, AI teams, security/compliance, and product stakeholders to ensure governance rules are embedded by design and enforceable in production. This includes special focus on PII handling, retention/legal hold, and AI data governance (training vs. inference separation and leakage prevention).
This is a hands-on architecture role where you will define governance standards, implementable patterns, and reference designs that scale across multi-tenant platforms and multiple customer domains.
1. Data Classification & Metadata Architecture
2. Privacy Controls for PII and Regulated Data
3. Retention, Deletion & Legal Hold
4. Lineage, Traceability & Auditability
5. Policy Enforcement & Governance-by-Design
6. AI Data Governance (Training vs. Inference)
Required Experience & Skills
Core Experience
Data Platform & Governance Tooling
Privacy, Retention & Compliance
AI/LLM Data Governance
Soft Skills & Ways of Working
Nice-to-Have / Preferred