Fastino Labs

The data, models, and platform to fine-tune edge-optimized SLMs.

fastino.ai

Fastino Labs

The data, models, and platform to fine-tune edge-optimized SLMs.

fastino.ai

HQRemote

Team Size23

Open JobsUnknown

Total Funding-

Latest FundraiseUnknown

TL;DR

Product: Task-specific language models (TLMs/GLiNER) and a Personalization API

Performance focus: CPU/edge-first, low-latency inference (claims of <50ms for some models)

Headquarters: Palo Alto, California

Team size: ~21 employees

Funding: ~$24–25M across Nov 2024 pre-seed and May 2025 seed

Company Overview

Problem Domain

Deploying efficient, task-optimized language models for low-latency inference on CPUs and edge devices for enterprise extraction, classification, and personalization tasks.

Industry

Artificial intelligence / Machine learning

Tech Stack

Task-specific language models (TLMs/SLMs/GLiNER)

Fine-tuning and dataset generation tooling

Personalization API

CPU/edge inference optimization

Funding Track Record

Pre-seed- Nov 2024

$7.0M

Seed- May 7, 2025

$17.5M

Investor Signal

“Backed by institutional investors including Khosla Ventures, Insight Partners, and M12; participation from additional angels and funds”

Founders

What we do

Join the Team

AI Platform Engineer

HybridSan Francisco Bay Area, US

Hybrid • San Francisco Bay Area, US

Related Companies

Company	HQ	Industry	Total Funding
Luma	🇺🇸US	Data and AnalyticsDeepTechGamingHardwareInformation TechnologySoftware	-
Stability AI	🇬🇧GB	Software	$231M
Mistral AI	🇫🇷FR	Data and AnalyticsDeepTechInformation TechnologySoftware	$3B
Predictive Horizons	🌍Remote	Data and AnalyticsDeepTechInformation Technology	$4M
Stealth Startup	🇺🇸US	Lending and InvestmentsSoftware	-

Full-time | Hybrid or Remote Introduction:

Join us at Fastino as we build the next generation of LLMs. Our team, boasting alumni from Google Research, Apple, Stanford, and Cambridge is on a mission to develop specialized, efficient AI.
Fastino's GLiNER family of open source models has been downloaded more than 5 million times and is used by companies such as NVIDIA, Meta, and Airbnb
Fastino has raised $25M (as featured in TechCrunch) through our seed round and is backed by leading investors including Microsoft, Khosla Ventures, Insight Partners, Github CEO Thomas Dohmke, Docker CEO Scott Johnston, and others.

The Role: We are looking for a systems-level engineer to own Fastino’s model platform end-to-end.

This is not a feature role.

You will design and build:

Training pipelines
Fine-tuning workflows
RL infrastructure
Data ingestion and curation systems
Inference services
Scalability and backend architecture

You will own the platform that turns models into production systems.

What You’ll Work On:

Architect distributed fine-tuning pipelines for small encoder and decoder models
Implement LoRA, adapters, distillation, and compression workflows
Design experiment tracking, reproducibility, and dataset versioning systems

Strong candidates will have:

Deep experience with PyTorch and transformer architectures
Experience building production ML systems end-to-end
Experience with distributed training and inference
Experience optimizing GPU workloads
Strong backend and systems engineering fundamentals
Experience with containerization and orchestration
Cloud infrastructure experience (AWS/GCP/Modal/Together.ai etc)

Bonus:

Experience with RL or RLHF
Experience with distillation and compression
Experience building internal ML platforms

Startup jobs. A lot of them.

Your next opportunity is in here somewhere. Sign up to explore 52,000+ startups and their open roles. No spam. No gamification. Just jobs.

52,000+

Startups

65,000+

Open Roles

1,300+

New This Week

Frontend Developer

InternshipTel Aviv

Internship • Tel Aviv

Product Designer

ContractJerusalem

Contract • Jerusalem

Data Scientist

Part-timeHaifa

Part-time • Haifa

Machine Learning Engineer

Part-timeManchester, GB

Part-time • Manchester, GB

Mobile Developer

Full-timeCambridge, GB

Full-time • Cambridge, GB

AI Researcher

Full-timeManchester, GB

Full-time • Manchester, GB