Vectara offers an enterprise-ready platform that embeds reliable generative AI and conversational search into applications while minimizing hallucinations and preserving provenance. It provides aโฆ
AI AgentsAI AssistantsGenerative AIHybrid RetrievalMultilingualSOC-2vectara.com
Vectara
Vectara offers an enterprise-ready platform that embeds reliable generative AI and conversational search into applications while minimizing hallucinations and preserving provenance. It provides aโฆ
AI AgentsAI AssistantsGenerative AIHybrid RetrievalMultilingualSOC-2vectara.com
HQCupertino, US
Team Size61
Open JobsUnknown
Total Funding$54M
Latest Fundraise2 years ago
TL;DR
Founded: 2020
Headquarters: Palo Alto, California
Product: Serverless RAG / conversational search API for grounded, explainable generative AI
Total funding: USD 53,500,000
Notable investors: Race Capital; participation from Emad Mostaque
Seed round reported June 13, 2023; participation noted from Emad Mostaque.
- 2024-07-16
USD 24,000,000
Aggregate funding details show total funding of USD 53,500,000 with a last funding date of 2024-07-16.
Investor Signal
โBacked by venture investors including Race Capital and a cohort of individual and institutional investors (examples listed on company materials).โ
Founders
What we do
Join the Team
Platform Engineer
RemoteUnited States, US
Remote โข United States, US
Related Companies
Company
HQ
Industry
Total Funding
webAI
๐บ๐ธUS
Blockchain and CryptoData and AnalyticsDeepTechInformation TechnologySoftware
$77M
A.Team
๐บ๐ธNew York City, US
Community and LifestyleData and AnalyticsDeepTechInformation TechnologyProfessional ServicesSoftware
$60M
Brightbeam
๐ฎ๐ชWaterford, IE
Data and AnalyticsDeepTechInformation TechnologySoftware
-
Qdrant
๐ฉ๐ชBerlin, DE
Data and AnalyticsDeepTechInformation TechnologyInternet ServicesSoftware
$88M
KAVIA AI
๐บ๐ธSan Francisco, US
Data and AnalyticsDeepTechInformation TechnologyInternet ServicesSoftware
$3M
Who you are
2+ years in platform engineering, DevOps, SRE, or backend infrastructure roles
Strong Kubernetes experience (deployment, debugging, scaling โ not just kubectl apply)
Hands-on with infrastructure-as-code: Terraform, Helm, or Pulumi
Experience with at least one major cloud provider (AWS preferred; GCP or Azure also valued)
Proficiency in one or more of: Go, Python, Java. Comfortable reading and contributing to backend codebases
Working knowledge of CI/CD systems (GitHub Actions, Bazel, ArgoCD, or similar)
Solid fundamentals in Linux, networking, and distributed systems
Experience deploying or operating ML inference workloads (model serving, GPU scheduling, vLLM, TensorFlow Serving, or similar)
Familiarity with streaming/messaging systems (Kafka, Pulsar) and data stores (MariaDB/PostgreSQL, Aerospike, ClickHouse, OpenSearch)
Experience with GitOps workflows (ArgoCD, Flux)
Exposure to air-gapped or on-premises Kubernetes deployments
Background in observability tooling (Prometheus, Grafana, OpenTelemetry, Datadog)
Experience providing technical support or working directly with enterprise customers on infrastructure issues
Comfort with AI-assisted development workflows and managing AI coding agents
What the job involves
You'll own the infrastructure that runs our deploy anywhere platform โ from Kubernetes clusters serving ML inference at scale to the CI/CD pipelines, IaC, and observability stack that keep it all reliable
Benefits
Comprehensive medical insurance
Life & disability insurance
Employee assistance program
Free gym access at HQ
Annual global summit
Free snacks and beverages
Bereavement leave
Startup jobs. A lot of them.
Your next opportunity is in here somewhere. Sign up to explore 70,000+ startups and their open roles. No spam. No gamification. Just jobs.
70,000+
Startups
83,000+
Open Roles
4,800+
New This Week
Mobile Developer
InternshipTel Aviv
Internship โข Tel Aviv
Backend Developer
Full-timeUtrecht, NL
Full-time โข Utrecht, NL
Software Engineer
ContractManchester, GB
Contract โข Manchester, GB
Machine Learning Engineer
InternshipMunich, DE
Internship โข Munich, DE
Mobile Developer
Full-timeManchester, GB
Full-time โข Manchester, GB
Machine Learning Engineer
InternshipNew York, US
Internship โข New York, US
This is a hands-on role: you'll write Helm charts and Terraform one day, debug a Kafka consumer lag issue the next, and ship a backend service feature the day after
You'll deploy across AWS, GCP, and on-premises (including air-gapped environments), and you'll participate in an on-call rotation supporting enterprise customers
Build and maintain infrastructure-as-code (Terraform, Helm) for our AWS EKS and GCP GKE clusters, plus on-premises deployments (including Tanzu and air-gapped environments)
Own CI/CD pipelines (GitHub Actions, Bazel, ArgoCD) and drive GitOps adoption
Deploy, scale, and optimize ML/NLP inference workloads (vLLM, PyTorch, GPU scheduling with various Kubernetes scalers)
Build and improve observability: Prometheus, Grafana, Datadog,, and OpenTelemetry
Collaborate with Field Engineering to support PoCs and platform deployments in customer cloud VPCs and on-prem environments
Contribute to backend services (Java 21, Python, gRPC) and platform features
Improve system reliability, scalability, and developer experience across the engineering org