
Orakl Oncology is a pioneering precision oncology company founded in 2023 as a spin-off from the Gustave Roussy Institute, focused on accelerating oncology drug development through a first-in-class…

Orakl Oncology is a pioneering precision oncology company founded in 2023 as a spin-off from the Gustave Roussy Institute, focused on accelerating oncology drug development through a first-in-class…
Founded: 2023 (spin‑out from Gustave Roussy)
Headquarters: Villejuif / Paris region, France
Focus: AI-powered precision oncology using patient-derived tumor avatars (organoids) and multimodal data
Products: O-Predict and O-Validate (AI commercial products)
Known funding: ≈ €14–15M across 2023–2024 (seed / pre-seed rounds)
Preclinical and clinical de-risking for oncology drug discovery and development, with initial tumor-type focus on colorectal and pancreatic cancers.
2023
Biotechnology Research
€3,000,000
Reported pre-seed/seed round
€11,000,000
Seed round with participation from multiple investors
“Singular; Bpifrance; Speedinvest; Verve Ventures; HCVC; SistaFund; Amazon Web Services”
Orakl Oncology is pioneering a new paradigm in cancer drug development by building the world’s largest cohort of patient-derived organoid (PDO) avatars. Through our unique platform, we generate extensive multi-modal data from these avatars — combined with rich clinical data from hospital partners — to discover and validate new oncology therapeutics with real-world patient relevance.
We are seeking a Senior Data Scientist to own the end-to-end clinical data chain at Orakl: from the design of data collection protocols with hospital partners, to the delivery of clean, structured, AI-ready datasets to our data science teams. This is a foundational role that sits at the intersection of clinical domain knowledge, data engineering, and machine learning infrastructure. You will work hand-in-hand with clinicians, data scientists, and regulatory experts to build the clinical data backbone that powers our flagship predictive oncology platform.
Your next opportunity is in here somewhere. Sign up to explore 52,000+ startups and their open roles. No spam. No gamification. Just jobs.
52,000+
Startups
65,000+
Open Roles
1,600+
New This Week
| Company |
|---|
Design Clinical Data Collection Protocols: Working directly with hospital teams and Clinical Research Associates (CRAs) , you’ll define the data points to collect based on clinical domain knowledge and predictive power, and translate them into electronic Case Report Form (eCRF) and data collection protocols ready for deployment in real clinical environments.
Own the Clinical Data Model: You’ll evaluate and decide on the right clinical data standards for Orakl’s context (FHIR, OMOP, or other), then define and maintain a unified data model that accommodates heterogeneous sources across hospital partners and scales as our network grows.
Build End-to-End Clinical Data Pipelines: You’ll develop and operate robust end to end pipelines, from raw eCRF outputs and hospital exports to structured, validated, AI-ready datasets. You will ensure every table delivered to data scientists is clean, consistent, and immediately usable.
Develop Hospital Feedback Loops: You’ll implement data quality control processes that automatically flag errors, inconsistencies, and anomalies in data received from hospital partners, and turn them into actionable feedback loops that protect both data quality and the partnership.
Feature Extraction: You’ll build PoCs for non-standard data source extractions: IHC, free-text clinical notes, and beyond, unlocking clinical signals for our AI models.### Minimum Qualifications