Who you are - Bachelor’s or Master’s degree in Computer Science, Software Engineering, or a related field - 3+ years in software development for Linux based server infrastructure - Experience managing cloud infrastructure (preferably AWS GovCloud) - Strong understanding of networking (TCP/IP, VLANs, firewalls) and systems architecture - Track record with monitoring, incident response, and root cause analysis for cloud infrastructure - Experience with reproducible system configuration (Nix/NixOS a big plus) - Prolonged periods sitting at a desk and working on a computer - Ability to operate a computer and other office productivity machinery (e.g., keyboard, mouse, and printer) - Visual acuity to read and write documentation, code, and email correspondence on a computer screen for extended periods - Manual dexterity sufficient to use a keyboard, mouse, and touchscreen - Ability to communicate effectively, including speaking and hearing over video/phone calls and in person - May be required to occasionally lift up to 15 pounds (e.g., laptops, monitors, or office equipment) - Capable of working in an office or remote setting, with occasional travel if specified ### What the job involves - We're seeking an Infrastructure Software Engineer to design, manage and scale our cloud and server infrastructure while improving operational efficiency - You'll join our Software team to ensure reliable systems that support our autonomous fleet operations - This role balances hands-on infrastructure development, system administration, automation and developer support initiatives - Operate and optimize build systems across on-premise and GovCloud environments including provisioning, patching, monitoring, backups, and lifecycle automation - Manage GovCloud infrastructure using Infrastructure-as-Code (Terraform) - Optimize resource usage and cost in hybrid and multi-account environments - Maintain secure and reliable cloud-to-ground data synchronization - Enable efficient developer workflows in cloud applications - Architect and enforce disaster recovery and recovery mechanisms in conjunction with the enterprise IT team - Maintain uptime SLAs and respond to infrastructure incidents - Configure and administer switches, firewalls, VPNs, and VLANs - Maintain core services (DNS, DHCP, authentication, certificate management) - Operate high-performance storage systems - Support key databases used in our autonomy software platform - Implement and audit system security policies affecting developer workflows, RBAC, and network segmentation - Manage user identity, authentication, and secrets across hybrid environments - Conduct patching and vulnerability remediation - Ensure compliance with regulatory and audit frameworks (e.x ITAR, FedRAMP) - Automate server configuration and deployment with Terraform and scripting - Build internal tools and self-service workflows to improve developer productivity - Maintain accurate documentation and operational runbooks - Support CI/CD pipelines, build systems, and development environments - Partner with engineering teams to support system integration, telemetry, and edge deployments - Plan for scaling infrastructure as the fleet and engineering organization grow