Miraei AI speeds drug development by reducing knowledge work and accelerating clinical decisions. It provides an agentic intelligence layer for pharma workflows that reason, adapt, and keep pace with complex development. The platform combines automated reasoning with adaptive workflows for life sciences teams. As a B2B SaaS solution, it integrates into clinical data platforms and workflow systems. Miraei aims to scale across programs to shorten timelines, expand approvals, and improve patient reach.
BiopharmaBiotechClinical DevelopmentCompetitive IntelligenceGenerative AILarge Language ModelsLead GenerationOncologyPredictive AnalyticsStrategymiraei.ai
Miraei AI
Miraei AI speeds drug development by reducing knowledge work and accelerating clinical decisions. It provides an agentic intelligence layer for pharma workflows that reason, adapt, and keep pace with complex development. The platform combines automated reasoning with adaptive workflows for life sciences teams. As a B2B SaaS solution, it integrates into clinical data platforms and workflow systems. Miraei aims to scale across programs to shorten timelines, expand approvals, and improve patient reach.
BiopharmaBiotechClinical DevelopmentCompetitive IntelligenceGenerative AILarge Language ModelsLead GenerationOncologyPredictive AnalyticsStrategymiraei.ai
HQRemote
Team Size4
Open Jobs1
Total Funding-
Latest FundraiseUnknown
Join the Team
Founding Data Engineer
RemoteUS
Remote • US
Teeming tracks opportunities at over 24,000 AI startups, then works with you to find (and land) the one you'll love.
Technical Writer
InternshipUtrecht, NL
Internship • Utrecht, NL
Machine Learning Engineer
ContractHaifa
Contract • Haifa
Technical Writer
ContractHaifa
Contract • Haifa
DevOps Engineer
Full-timeNiš, RS
Full-time • Niš, RS
Product Designer
ContractTel Aviv
Contract • Tel Aviv
Product Designer
Full-timeNovi Sad, RS
Full-time • Novi Sad, RS
Founding Data Engineer, Clinical Trials and Oncology Data
Location:
Hybrid, San Francisco and/or Los Angeles
Experience:
3 to 7 years
Type:
Full-time
Stage:
Early, founding engineering hire
About Miraei
Miraei is building the deal engine for life sciences.
Business development in life sciences is still driven by fragmented data, manual research, and slow, relationship-heavy workflows. Miraei changes that by structuring and continuously tracking clinical trials and scientific data, then transforming it into actionable intelligence that powers how deals are identified, evaluated, and executed.
We start by helping vendors and diagnostics companies identify and engage the right biopharma partners around active and emerging clinical trials. Over time, Miraei becomes the platform where life sciences deals occur end to end, from vendors to biopharma, biopharma to biotechs, and cross-border partnerships such as biopharma seeking assets and collaborators internationally.
We are venture-backed and are generating revenue from enterprise customers.
The role
We are hiring a Founding Data Engineer to design and own the core data architecture, pipelines, and processes that powers Miraei. This role is responsible for building the canonical data models for clinical trial intelligence and ensuring our data pipelines are scalable and reliable as we ingest more sources, trials, and send out real-time updates.
This is a hands-on individual contributor role. You will write production code, make architectural decisions, and shape the long-term data foundation of the company.
What you will do
Design and implement core data schemas for clinical trial data and data sources related to clinical assets, including
What we’re looking for
Nice to have
Oncology domain expertise or familiarity
Experience with ontology, RAG/knowledge graph, vector databases or other information retrieval experience
Exposure to ML feature pipelines, context engineering, prompt engineering, and other AI-adjacent systems
Compensation and benefits
Base salary:
$150k to $180k, depending on experience
Equity:
0.75% to 1.5% fully diluted, 4-year vest with 1-year cliff
Benefits:
Full benefits included
Why this role matters
The data layer is the product. Decisions made here will define what Miraei can and cannot become. This is a foundational role with real ownership, autonomy, and long-term impact.
Trials, arms, cohorts, endpoints, biomarkers, sponsors, and timelines
Longitudinal versioning across abstracts, amendments, and readouts
Press releases, news, and publications
Build hierarchical taxonomies and ontologies for oncology and clinical research
Indications, modalities, mechanisms of action, biomarkers, endpoints
Architect and maintain data ingestion pipelines from
Conference abstracts
Clinical trial registries
Publications and structured internal outputs
Enable longitudinal tracking and alerting as trials evolve over time
Partner closely with product and ML to ensure the data model supports downstream reasoning and user workflows
Make pragmatic early-stage tradeoffs and evolve the system as the company scales
3 to 7 years of experience as a data engineer or analytics engineer
Prior experience working with clinical trial or life sciences data strongly preferred
Pharma, biotech, diagnostics, CRO, real-world data, or clinical informatics
Startup experience required
You have built systems in ambiguous, fast-moving environments
Strong fundamentals in:
Database design (OLTP/OLAP), data modeling, metadata management, and schema design
Skills in building reliable ETL/ELT pipelines, data integration, transformation, validation, and orchestration
SQL/Python/Bash scripting
Cloud-based data infrastructure (AWS/GCP)
Experience with modern software development tools, such as version control (git), automations/CI/CD (GitHub actions, Jenkins, etc), Docker containerization, etc
Comfortable owning systems end to end as a senior IC
Clear communicator who can explain tradeoffs and push back when needed
Must be authorized to work in the United States. Visa sponsorship is not available for this position.