Lexsi Labs is building the next generation of
Tabular Foundation Models (TFMs)
to make structured prediction as reusable and scalable as foundation models made language and vision.
We are already deep in this space with concrete systems and research artifacts:
- We built
TabTune
, a unified library for inference, benchmarking, evaluation (including calibration and fairness), and fine-tuning across multiple leading TFMs through a single API.
- We developed frontier architectures like
Orion-MSP
, introducing hierarchical multi-scale sparse attention and memory mechanisms designed to scale tabular in-context learning to wide, high-dimensional tables.
- We are pushing the ecosystem toward
Institutional Tabular Foundation Models (ITFMs)
, arguing for table-native predictors that capture institutional knowledge directly, without routing tables through text, and emphasizing robustness to drift, schema evolution, governance, and auditability.
Our ambition is simple and disruptive:
replace fragmented task-by-task modeling with foundation models for structured decision system
s, and fundamentally change how data science is practiced in real organizations
**.
The Ro**
leAs a
n AI Research Scientist (TFM
s), you will design ne
w tabular foundation model architectures, training methods, and evaluation syste
ms that push beyond today’s best baselines and unlock transfer, scale, and reliability for structured dat
a.This is not “apply transformers to tables.” This role is about building what comes next
: new modeling primitives for structured da
ta, new pretraining regimes, and models that can survive real institutional complexit
**y.
Responsibilit**
- iesInvent and prototy
pe new TFM architectu
res that improve generalization, scalability (wide tables, many features), and transfer across tasks and schem
- as.Develop learning paradigms for structured data includi
ng in-context learning, pretraining on synthetic/real mixtures, and efficient adaptation mechani
s
**ng.
Ideal Qualificat**
- ionsPhD (or equivalent research track record) in ML/AI/CS/Math/Stats or a related quantitative field, with demonstrated ability to execute independent resea
- rch.Strong foundation in deep learning and representation learning, with experience designing and analyzing architectures (transformers, attention variants, memory mechanisms, sparse modeling, mixture-of-experts, or relat
- ed).Proven ability to run rigorous empirical research: dataset curation, ablations, benchmarking, reproducibility discipline, and meaningful evaluation beyond single-metric w
- ins.Strong engineering ability in Python and modern ML stacks (PyTorch or JAX), with comfort building research codebases that others can ext
- end.Depth or strong interest in structured/tabular learning, including challenges like heterogeneity, missingness, feature interactions, schema variation, drift, and real-world decision constrai
**cts.
Nice to**
- HaveExperience with tabular deep learning, tabular ICL, or foundation model pretraining regimes for structured
- data.Familiarity with institutional ML constraints: drift monitoring, auditability, leakage prevention, calibration, fairness, and governance requirem
- ents.Experience building research infrastructure or libraries used by others (evaluation harnesses, benchmarking tools, training framewo
**rks).
What Success Look**
- s LikeYou
ship new archite
ctures that move the TFM frontier forward, especially in wide-table and cross-task transfer re
- gimes.You help evolve TFMs
into institutional-grade pred
ictors, with principled handling of drift, schema evolution, leakage resistance, and gover
- nance.You contribute to a research + systems loop where new ideas quickly b
ecome reproducible experiments, open tooling, and deployable model fa
m
ilies.