Location
Taguig, National Capital Region, Philippines
Posted
June 11, 2026
Job Description
We are looking for a Production Support Engineer to monitor and maintain the health of applications, infrastructure, APIs, and services in a 24/7 production environment. The role involves real-time monitoring, incident management, troubleshooting, and collaboration with engineering teams to ensure system reliability and performance.
Key Responsibilities
- Monitor system health across applications, infrastructure, APIs, and services using observability and monitoring tools.
- Review and respond to alerts, dashboards, and metrics in real time.
- Create, expand, and maintain dashboards and alerting to improve observability coverage.
- Perform initial triage using logs, traces, and metrics; identify symptoms and potential root causes.
- Execute runbooks/SOPs for common production issues (e.g., restarts, validation checks, health checks).
- Create incidents, document findings, and collaborate with engineering teams ...