
DataTecnica is a company focused on accelerating innovation and maximizing the potential of scientific communities within healthcare. They leverage expertise in data science, computer science, biology, and medicine to build software tools and platforms that expedite research for clients in federal health, biotech, and philanthropy. Their business model involves collaborative software development and deploying open-source toolkits for genomics and machine learning workflows, aiming to increase research efficiency and transparency through automation and standardization. Key products include DataTecnica.AI, a biomedical research assistant, and GenoML, an open-source toolkit for genomics and machine learning. They emphasize open science, making data and code accessible to the scientific community.

DataTecnica is a company focused on accelerating innovation and maximizing the potential of scientific communities within healthcare. They leverage expertise in data science, computer science, biology, and medicine to build software tools and platforms that expedite research for clients in federal health, biotech, and philanthropy. Their business model involves collaborative software development and deploying open-source toolkits for genomics and machine learning workflows, aiming to increase research efficiency and transparency through automation and standardization. Key products include DataTecnica.AI, a biomedical research assistant, and GenoML, an open-source toolkit for genomics and machine learning. They emphasize open science, making data and code accessible to the scientific community.
Genetics Scientist (Whole Genome Sequencing)
DataTecnica | Remote/Flexible | Full-time (2,000 hours per year contracting)
About DataTecnica
DataTecnica is a biomedical AI company specializing in analytics for neurological and other diseases. We work at the intersection of large-scale genomics, clinical data, and translational research—partnering with major initiatives from academia and non-profits to tech including large scale collaborative efforts like the Global Parkinson's Genetics Program (GP2), and the NIH’s Center for Alzheimer’s and Related Dementias (CARD). Our team has contributed to over 700 peer-reviewed publications in neurogenetics and neurodegenerative disease research.
About the Role
We're seeking a Genetics Scientist (Whole Genome Sequencing) to assist with WGS data processing and variant discovery for precision medicine programs targeting Parkinson’s disease risk and progression for GP2. You'll drive large-scale joint calling pipelines and execute sophisticated variant analyses including short tandem repeats (STRs), structural variants (SVs), and rare variant burden tests at biobank scale.
What You'll Do
Primary Responsibilities (70%)
Secondary Responsibilities (30%)
Required Qualifications
Preferred Qualifications
Technical Environment
You'll work in cloud-native genomics environments including Terra, Verily Workbench, and the UK Biobank RAP. Familiarity with WDL/Cromwell or Nextflow workflows is strongly preferred. We leverage both CPU and GPU compute for large-scale analyses in the cloud.
To Apply
Submit CV and cover letter with links to 2-3 representative publications demonstrating WGS processing and analysis expertise as well as related GitHub content.
Compensation
$100,000 to $200,000 / year total compensation commensurate with experience.