
Reducto turns unstructured documents—like PDFs, images, and spreadsheets—into clean, structured data ready for any workflow. Our multi-pass parsing system combines OCR and vision-language models to…

Reducto turns unstructured documents—like PDFs, images, and spreadsheets—into clean, structured data ready for any workflow. Our multi-pass parsing system combines OCR and vision-language models to…
Product: API-first document intelligence: parse, split, extract, edit for 30+ formats and 100+ languages
Tech: Multi-pass pipeline combining layout-aware computer vision, vision-language models, and Agentic OCR
Founding: Founded out of MIT by Adit Abraham and Raunak Chowdhuri
Security & Deployment: Enterprise features: SOC 2, HIPAA-capable, on-prem/VPC, zero data retention, BAA options
Recent Funding: Raised a $24.5M Series A (Apr 25, 2025); prior Seed $8.4M and Pre-Seed $0.5M
Document intelligence / data extraction from unstructured documents for AI workflows
Software Development
$500,000
$8,400,000
$24,500,000
“Backed by First Round Capital, Benchmark, Y Combinator, BoxGroup, a16z (announced Series B led by a16z and company statements noting $108M total funding)”
| Company |
|---|
About Reducto Reducto helps AI teams ingest real-world enterprise data with state-of-the-art accuracy.
Most enterprise data, from financial statements to health records, is locked in unstructured file formats like PDFs and spreadsheets. We train vision models to read those documents the way a human would, enabling teams to build products, train models, and automate processes at scale.
We’ve grown rapidly, increasing revenue 7x year over year and partnering with hundreds of companies, from leading AI teams like Harvey, Vanta, and Scale, to enterprise customers across FAANG and top trading firms.
Reducto has raised over $100M from world-class investors including a16z, Benchmark, and First Round Capital.
The Opportunity As an ML Eval Engineer, you’ll play a key role in building the evaluation systems and benchmarks that make Reducto’s models better over time. You’ll collaborate closely with our ML, platform, and GTM teams to identify model weaknesses, design strong benchmarks, and create metrics and tooling that surface new failure modes as we scale. This is a high-impact role where you’ll help define how model quality is measured at Reducto and shape the systems we use to improve it.
What You’ll Do
You’ll Thrive Here If You:
Bonus Points If You:
This is an in person role at our office in SF. We’re an early stage company which means that the role requires working hard and moving quickly. Please only apply if that excites you.
About Reducto Nearly 80% of enterprise data is in unstructured formats like PDFs PDFs are the status quo for enterprise knowledge in nearly every industry. Insurance claims, financial statements, invoices, and health records are all stored in a structure that’s simply impractical for use in digital workflows. This isn’t an inconvenience—it’s a critical bottleneck that leads to dozens of wasted hours every week.
Traditional approaches fail at reliably extracting information in complex PDFs OCR and even more sophisticated ML approaches work for simple text documents but are unreliable for anything more complex. Text from different columns are jumbled together, figures are ignored, and tables are a nightmare to get right. Overcoming this usually requires a large engineering effort dedicated to building specialized pipelines for every document type you work with.
Reducto Breaks Document Layouts Into Subsections And Then Contextually Parses Each Depending On The Type Of Content. This Is Made Possible By a Combination Of Vision Models, LLMs, And a Suite Of Heuristics We Built Over Time. Put Simply, We Can Help You:
Benefits at Reducto
At Reducto, we’re invested in the well-being and growth of our team. Here’s what we currently offer:
Reducto is an Equal Opportunity Employer committed to diversity and inclusion in the workplace. All qualified applicants will receive consideration for employment without regard to sex, race, color, age, national origin, religion, physical and mental disability, genetic information, marital status, sexual orientation, gender identity/assignment, citizenship, pregnancy or maternity, protected veteran status, or any other status prohibited by applicable national, federal, state or local law.
Your next opportunity is in here somewhere. Sign up to explore 52,000+ startups and their open roles. No spam. No gamification. Just jobs.
52,000+
Startups
60,000+
Open Roles
500+
New This Week