
Since 2017, the number of Python users has been increasing by millions annually. The vast majority of these people leverage Python as a tool to solve problems at work. Our mission is to make them autonomous when they create and use data in their organizations. For this end, we are building an open source Python library called data load tool (dlt). Our users use dlt in their Python scripts to turn messy, unstructured data into regularly updated datasets. It empowers them to create highly scalable, easy to maintain, straightforward to deploy data pipelines without having to wait for help from a data engineer. We are dedicated to keeping dlt an open source project surrounded by a vibrant, engaged community. To make this sustainable, dltHub stewards dlt while also offering additional software and services that generate revenue (similar to what GitHub does with Git). dltHub is based in Berlin and New York City. It was founded by data and machine learning veterans. We are backed by Dig Ventures and many technical founders from companies such as Hugging Face, Instana, Matillion, Miro, and Rasa.

Since 2017, the number of Python users has been increasing by millions annually. The vast majority of these people leverage Python as a tool to solve problems at work. Our mission is to make them autonomous when they create and use data in their organizations. For this end, we are building an open source Python library called data load tool (dlt). Our users use dlt in their Python scripts to turn messy, unstructured data into regularly updated datasets. It empowers them to create highly scalable, easy to maintain, straightforward to deploy data pipelines without having to wait for help from a data engineer. We are dedicated to keeping dlt an open source project surrounded by a vibrant, engaged community. To make this sustainable, dltHub stewards dlt while also offering additional software and services that generate revenue (similar to what GitHub does with Git). dltHub is based in Berlin and New York City. It was founded by data and machine learning veterans. We are backed by Dig Ventures and many technical founders from companies such as Hugging Face, Instana, Matillion, Miro, and Rasa.
Product: Open-source Python library 'dlt' for building EL/ELT pipelines; platform product 'dltHub' planned
Headquarters: Berlin and New York City
Founders: Four co-founders including Matthaus Krzykowski (CEO) and Marcin Rudolf (CTO)
Funding: $1.5M pre-seed (Jul 2023), lead investor Dig Ventures
Team size: 38 employees (stated)
| Company |
|---|
Data engineering / EL/ELT for Python-first teams; pipeline automation, data ingestion, and governance.
Data infrastructure / Data engineering
$1.5M
Round announced July 20, 2023
“Backed by Dig Ventures and other venture/backer mentions including Foundation Capital and technical founders”
About dlthub
At dltHub, we're building on the success of dlt, the most popular open-source library for data loading in Python with ~ 4 million monthly downloads.
After proving its value in the OSS community, we’re now building dltHub to make data engineering accessible to all Python developers, whether they’re launching their first analytics workflow or powering full ML-driven organizations.
Our mission is to make data engineering as accessible, collaborative, and frictionless as writing Python itself. dltHub is a workspace and runtime that empowers any Python developer to build, run, and maintain data pipelines, transformations, and notebooks, without needing a data platform or infrastructure team.
Our team is based in Berlin, now growing into other locations, especially across the US. Founded by experienced data and ML engineers, we're backed by Foundation Capital, Dig Ventures, and technical founders from Hugging Face, Instana, Matillion, Miro, MotherDuck, Datadog, Mode, and Rasa.
**Our team is driven by clear values: we speak with courage, build what matters, automate relentlessly, show up with energy, deliver on commitments, and win together.
🚀 About the Role**
We are building the next generation of AI-native data tooling - building our AI-first data platform on top of our popular open-source data ingestion core.
We are looking for an entrepreneurial software engineer based in Berlin who thrives in a fast-paced environment with a high degree of autonomy and ownership. You will work on solving data platform challenges for our customers: ingestion, storage, performance, reliability, observability and AI-assisted developer experience. If you enjoy working with an ambitious and driven team and care deeply about code quality, data systems, and building tools that engineers enjoy using, this role is for you.
🗂️ Key Responsibilities
What We’re Looking For
✅ Qualifications
🎯 Nice to Have
In our work culture, we value each other’s autonomy and efficiency. We have set hours for communication and deep work. We like automation, so we automate our work before we automate the work of others.