2. Senior Site Reliability Engineer (×3)
Our SRE team is the operational backbone of Latitude. We run bare metal infrastructure in 23 locations — real hardware, real networks, real consequences.
What you'll do
Own reliability for our global bare metal fleet — monitoring, alerting, incident response, post-mortems
Build and maintain internal tooling: Netbox (infra source of truth), Python/Go services
Drive automation for hardware lifecycle: provisioning, decommissioning, firmware updates, network changes
Collaborate with platform engineers on the provisioning stack
Participate in on-call rotation
4+ years SRE or infrastructure engineering
Strong Linux fundamentals — kernel, network, and hardware layers
Network automation experience (BGP, VLANs, IPAM) is a significant plus
Proficiency in Python or Go for internal tooling
Experience with Netbox, Prometheus, or similar tools
You thrive owning things end-to-end
Nice to have: Bare metal ops experience · Tinkerbell/PXE systems · network engineering background (CCNA/CCNP or equivalent)