Site reliability engineer

Insight Global

Full-time Other-General
Apply Now
Location
cotia, maranhรฃo, Brazil
Posted
June 08, 2026

Job Description

Job Title: Site Reliability Engineer (SRE) / Infrastructure Operations MID LEVEL

Role Overview

Responsible for managing day-to-day infrastructure operations, including monitoring, alerting, and driving stability improvements across the environment.

Key Responsibilities Monitor overall infrastructure health and system performance Track key performance metrics such as CPU, memory, and disk utilization Tune alerts to improve signal-to-noise ratio and reduce alert fatigue Support disaster recovery (DR) rehearsals and readiness activities Maintain and update runbooks, documentation, and operational reports

Required Experience 4โ€“6 years of experience in Site Reliability Engineering (SRE) or infrastructure operations Hands-on experience with VMware environments Experience with monitoring tools such as PRTG, Datadog, or similar platforms Strong incident management experience, including response and resolution processes

Core Skills & Competencie...