About xAI
XAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All employees are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates. ABOUT THE ROLE:
You will be the person who turns a hardware listing and a software bundle into a running AI inference platform — from bare metal to serving production traffic. This is a hands-on role at the intersection of physical datacenter infrastructure and platform engineering. You will rack GPU servers, cable network fabrics, provision bare metal via PXE, deploy Kubernetes clusters, stand up monitoring and network telemetry stacks, and validate end-to-end inference pipelines — all in air-gapped, classified environments with no internet access.
You are the high side. Everything the platform engineering team builds on the unclassified side — deployment tooling, signed software bundles, switch configurations, OS images — you execute on classified infrastructure. You own the full stack from physical hardware through running GPU workloads, including the cross-domain solution (CDS) receive pipeline that automates software delivery into the classified environment. When something breaks on-site, you fix it. When an update arrives through the data diode or on physical media, you apply it. You are the bridge between xAI's engineering organization and the classified compute facilities where our infrastructure operates.
Validate network fabric correctness using LLDP verification, BGP peering checks, and InfiniBand fabric topology validation after initial deployment and hardware changes. Serve as the keyboard operator for network troubleshooting directed by the Network Architect — you execute commands on classified network devices while the architect directs the session on-site or via approved channels.
Execute compliance and security validation: run STIG scans (OpenSCAP) against deployed systems, verify FIPS 140-3 mode on all nodes, validate AV agent status, and execute pre-admission security checklists before nodes are allowed to serve classified workloads. Document and report compliance status for ATO packages.
Interface with customer IT, security, and facility teams. Participate in change control board (CCB) processes for classified system modifications. Train customer operations teams on monitoring dashboards, alert response procedures, and basic operational runbooks during deployment handoff.
COMPENSATION AND BENEFITS: $180,000 - $440,000 USD
Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks. xAI is an equal opportunity employer. For details on data processing, view our Recruitment Privacy Notice .