Persimmons, Inc. is a semiconductor company specializing in advanced AI inference hardware and software systems. Their flagship product is the PL1000 chiplet architecture, which offers hyper-scalable, modular AI inference solutions with high compute density, low power consumption, and large model support. The PL1000 enables AI adaptability from edge devices to large data centers, improving cost efficiency and reducing operational risks. Founded by industry veterans with experience from Microsoft, Amazon, Apple, Intel, and leading AI research institutions, Persimmons aims to accelerate AI innovation and deployment with a focus on Generative AI and next-generation inference computing technology.
AI InferenceData CentersEdge ComputingGenerative AIHigh Compute DensityLow PowerModular DesignSemiconductorspersimmons.ai
Persimmons
Persimmons, Inc. is a semiconductor company specializing in advanced AI inference hardware and software systems. Their flagship product is the PL1000 chiplet architecture, which offers hyper-scalable, modular AI inference solutions with high compute density, low power consumption, and large model support. The PL1000 enables AI adaptability from edge devices to large data centers, improving cost efficiency and reducing operational risks. Founded by industry veterans with experience from Microsoft, Amazon, Apple, Intel, and leading AI research institutions, Persimmons aims to accelerate AI innovation and deployment with a focus on Generative AI and next-generation inference computing technology.
AI InferenceData CentersEdge ComputingGenerative AIHigh Compute DensityLow PowerModular DesignSemiconductorspersimmons.ai
HQSanta Clara, US
Team Size17
Open Jobs2
Total Funding-
Latest FundraiseUnknown
Join the Team
Compiler Engineer (Frontend)
On-SiteSan Jose, US
On-Site • San Jose, US
Teeming tracks opportunities at over 24,000 AI startups, then works with you to find (and land) the one you'll love.
Product Designer
Part-timeTel Aviv
Part-time • Tel Aviv
Technical Writer
Part-timeLondon, GB
Part-time • London, GB
Technical Writer
InternshipTel Aviv
Internship • Tel Aviv
Technical Writer
Part-timeBelgrade, RS
Part-time • Belgrade, RS
Frontend Developer
Full-timeNovi Sad, RS
Full-time • Novi Sad, RS
Mobile Developer
InternshipCambridge, GB
Internship • Cambridge, GB
Who we are:
Persimmons is building the infrastructure that will power the next decade of AI. Founded in 2023 by veteran technologists from the worlds of semiconductors, AI systems, and software innovation, We’re on a mission to enable smarter devices, more sustainable data centers, and entirely new applications the world hasn’t imagined yet.
Why join us:
We’re growing fast and looking for bold thinkers, builders, and curious problem-solvers who want to push the limits of AI hardware and software. If you're ready to join a world-class team and play a critical role in making a global impact - we want to talk to you.
What you’ll do:
As a leader on our team, you have the opportunity of working on optimizing our Persimmons Compiler.
Design and build the compiler that converts AI models from popular ML frameworks into assembly code that runs on our accelerator hardware.
Develop and implement novel scheduling algorithms that push the boundaries of technology.
Collaborate with cross-functional teams to design, test, and optimize our hardware and software solutions.
Analyze and improve the efficiency, scalability, and performance of our systems.
Stay abreast of industry trends and advancements to ensure our solutions remain competitive and innovative.
Provide technical leadership across the compiler team, mentoring engineers in advanced compiler techniques, and help scale the team as the company grows.
Requirements
What You Bring To The Table:
Benefits
Competitive salary and benefits package.
Flexible PTO
401k
Please note: Our organization does not accept unsolicited candidate submissions from external recruiters or agencies. Any such submissions, regardless of form (including but not limited to email, direct messaging, or social media), shall be deemed voluntary and shall not create any express or implied obligation on the part of the organization to pay any fees, commissions, or other compensation. Direct contact of employees, officers, or board members regarding employment opportunities is strictly prohibited and will not receive a response.
6+ years of experience in compiler development, and deep knowledge of modern compiler frameworks (LLVM, MLIR, TVM, XLA, IREE)
Proven track record of leading compiler or runtime systems projects from design through deployment.
Experience processing models from popular frameworks (e.g. PyTorch, TensorFlow, JAX), and familiarity with model architecture and workloads (transformers, diffusion models, etc.)
Familiarity with auto-scheduling and program synthesis techniques for high-performance ML kernels (e.g., TVM, Halide, or Ansor), or experience with other hardware-aware scheduling techniques.
Familiarity with hardware architectures and their optimization implications, including memory hierarchies, systolic arrays, DMA engines, and GPU-style parallelism.
If you can do the above, you already have strong C++ and python skills
BS/MS/PhD degree in Computer Science, Computer Engineering, or related field (or equivalent experience)
Strong interpersonal, verbal and written communications skills
Capability to achieve objectives under tight deadlines
Experience executing tasks while managing competing priorities
Practical knowledge working with large code bases
Experience writing and debugging multithreaded programs
Deep understanding of technology and passion for what you do
Strong teamwork, specifically a proven ability to effectively guide and influence within a dynamic matrix environment
Excellent problem-solving skills and the ability to work in a dynamic, fast-paced environment.