
The data, models, and platform to fine-tune edge-optimized SLMs.
Product: Task-specific language models (TLMs/GLiNER) and a Personalization API
Performance focus: CPU/edge-first, low-latency inference (claims of <50ms for some models)
Headquarters: Palo Alto, California
Team size: ~21 employees
Funding: ~$24–25M across Nov 2024 pre-seed and May 2025 seed
Deploying efficient, task-optimized language models for low-latency inference on CPUs and edge devices for enterprise extraction, classification, and personalization tasks.
Artificial intelligence / Machine learning
$7.0M
$17.5M
“Backed by institutional investors including Khosla Ventures, Insight Partners, and M12; participation from additional angels and funds”
| Company |
|---|
Full-time | Hybrid or Remote Introduction:
The Role: We are looking for a systems-level engineer to own Fastino’s model platform end-to-end.
This is not a feature role.
You will design and build:
You will own the platform that turns models into production systems.
What You’ll Work On:
Strong candidates will have:
Bonus:
Your next opportunity is in here somewhere. Sign up to explore 49,000+ startups and their open roles. No spam. No gamification. Just jobs.
49,000+
Startups
44,000+
Open Roles
0+
New This Week