SRE

São Paulo Remote

2. Senior Site Reliability Engineer (×3)

Our SRE team is the operational backbone of the company. We run bare metal infrastructure in 23 locations — real hardware, real networks, real consequences.

What you'll do

Own reliability for our global bare metal fleet — monitoring, alerting, incident response, post-mortems

Build and maintain internal tooling: Netbox (infra source of truth), Python/Go services

Drive automation for hardware lifecycle: provisioning, decommissioning, firmware updates, network changes

Collaborate with platform engineers on the provisioning stack

Participate in on-call rotation

Requirements

4+ years SRE or infrastructure engineering

Strong Linux fundamentals — kernel, network, and hardware layers

Network automation experience (BGP, VLANs, IPAM) is a significant plus

Proficiency in Python or Go for internal tooling

Experience with Netbox, Prometheus, or similar tools

You thrive owning things end-to-end

Nice to have: Bare metal ops experience · Tinkerbell/PXE systems · network engineering background (CCNA/CCNP or equivalent)

Apply

by Quickin

Português | English | Español