Fastino Labs

The data, models, and platform to fine-tune edge-optimized SLMs.

fastino.ai

Fastino Labs

The data, models, and platform to fine-tune edge-optimized SLMs.

fastino.ai

HQRemote

Team Size22

Open Jobs9

Total Funding-

Latest FundraiseUnknown

TL;DR

Product: Task-specific language models (TLMs/GLiNER) and a Personalization API

Performance focus: CPU/edge-first, low-latency inference (claims of <50ms for some models)

Headquarters: Palo Alto, California

Team size: ~21 employees

Funding: ~$24–25M across Nov 2024 pre-seed and May 2025 seed

Company Overview

Problem Domain

Deploying efficient, task-optimized language models for low-latency inference on CPUs and edge devices for enterprise extraction, classification, and personalization tasks.

Industry

Artificial intelligence / Machine learning

Tech Stack

Task-specific language models (TLMs/SLMs/GLiNER)

Fine-tuning and dataset generation tooling

Personalization API

CPU/edge inference optimization

Funding Track Record

Pre-seed- Nov 2024

$7.0M

Seed- May 7, 2025

$17.5M

Investor Signal

“Backed by institutional investors including Khosla Ventures, Insight Partners, and M12; participation from additional angels and funds”

Founders

What we do

Join the Team

Software Engineer - Large Language Models

RemoteGB

Remote • GB

Related Companies

Company	HQ	Industry	Total Funding
Stability AI	🇬🇧GB	Software	$231M
Mistral Ai	🇫🇷FR	Data and AnalyticsDeepTechInformation TechnologySoftware	-
Stealth Startup	🇺🇸US	Lending and InvestmentsSoftware	-
Oumi	🌍Remote	Data and AnalyticsDeepTechInformation TechnologySoftware	-
Luma AI	🇺🇸US	Data and AnalyticsDeepTechGamingHardwareInformation TechnologySoftware	$87M

Full-time | Remote with trips to Silicon Valley office | Reports to Founders

Introduction:

Join us at Fastino as we build the next generation of LLMs. Our team, boasting alumni from Google Research, Apple, Stanford, and Cambridge is on a mission to develop specialized, efficient AI.
Fastino's GLiNER family of open source models has been downloaded more than 5 million times and is used by companies such as NVIDIA, Meta, and Airbnb
Fastino has raised $25M (as featured in TechCrunch) through our seed round and is backed by leading investors including Microsoft, Khosla Ventures, Insight Partners, Github CEO Thomas Dohmke, Docker CEO Scott Johnston, and others.

What You’ll Work On:

Experiment with novel language model architectures, helping drive and execute Fastino's research roadmap
Optimize Fastino’s multimodal models to improve response quality, instruction adherence, and overall performance metrics
Architect data processing pipelines, implementing filtering, balancing, and captioning systems to ensure training data quality across diverse content categories
Implement reinforcement learning techniques including Direct Preference Optimization and Generalized Reward Preference Optimization to align model outputs with human preferences and quality standards
Build robust and real-world motivated evaluations
Partner with Fastino engineering team to ship model updates directly to customers
Establish best practices for code health and documentation on the team, to facilitate collaboration and reliable development

What We’re Looking For:

Required - Great velocity for building and shipping agents / AI products.
Optional - Advanced degree (Master's or PhD) in Computer Science, Artificial Intelligence, Machine Learning, or related technical discipline with concentrated study in deep learning and computer vision methodologies
Optional - Demonstrated ability to do independent research in Academic or Industry settings
Optional - Substantial industry experience in large-scale deep learning model training, with demonstrated expertise in at least one of Large Language Models, Vision-Language Models, Diffusion Models, or comparable generative AI architectures
Optional - Comprehensive technical proficiency and practical experience with leading deep learning frameworks, including advanced competency in one of PyTorch, JAX, TensorFlow, or equivalent platforms for model development and optimization

Startup jobs. A lot of them.

Your next opportunity is in here somewhere. Sign up to explore 49,000+ startups and their open roles. No spam. No gamification. Just jobs.

49,000+

Startups

44,000+

Open Roles

New This Week

Backend Developer

Part-timeMunich, DE

Part-time • Munich, DE

Machine Learning Engineer

Part-timeLondon, GB

Part-time • London, GB

Product Designer

Part-timeBerlin, DE

Part-time • Berlin, DE

Mobile Developer

InternshipJerusalem

Internship • Jerusalem

Product Designer

ContractMunich, DE

Contract • Munich, DE

Data Scientist

Full-timeBerlin, DE

Full-time • Berlin, DE