What exactly you will do here:
- Team Leadership - Lead and manage a team of 3-5 Application Reliability Engineers, providing guidance, mentorship, and support to ensure the team’s success
- Responsible for System Reliability - Ensuring the availability, latency, performance and efficiency of our systems
- Collaboration - Work closely with development and operations teams to balance the need for new features with the need for system stability
- Incident Management - Handle incidents, conduct root causes analysis and work on improving system resilience
- Automation - Lead proactive automation initiatives to enhance operational efficiency