Site Reliability Engineering (SRE) Manager

القاهرة الجديدة
دائم
دوام كامل

قبل 2 أشهر

The Site Reliability Engineering (SRE) Manager is responsible for defining and implementing the SRE framework and best practices across the organization, ensuring the reliability, performance, and scalability for business-critical digital systems. This role demands a proactive, technically hands-on, and process-oriented leader with the seniority to dotted-line lead SREs responsible for different platforms. Collaborate closely with platform teams and service providers to foster a strong culture of collaboration and operational excellence, while driving value realization and continuous improvement.YOUR NEW KEY RESPONSIBILITIES:1. Framework Creation & Best Practices:

Define, implement, and maintain the SRE framework in collaboration with Platform SREs

Ensure the framework aligns with organizational goals around performance, availability, and operational excellence

Promote standardized best practices to drive consistent execution across diverse technology platforms

Drive value realization by ensuring the framework leads to tangible improvements in reliability, efficiency, and customer satisfaction

2. SRE Community Building:

Build and nurture an engaged SRE community of practice:

- Design and manage a structured onboarding process for new SREs- Provide continuous guidance, mentoring, and support to Platform SREs- Ensure all teams are fully aligned with the SRE framework, practices, and operational standards- Foster a learning culture that promotes operational excellence and continuous improvement

Establish regular routines for Platform SREs:

- Share challenges and resolve dependencies- Exchange best practices- Review KPIs and performance metrics- Collaborate on solutions that improve Performance, Availability, Incident Reduction, and MTTR (Mean Time to Recovery)3. Operational Management:

Oversee troubleshooting efforts for complex problems and root cause analysis in collaboration with L3/L4 support vendors. Lead the technical resolution of major IT disruptions with required teams (not limited to regular working hours).

Drive proactive improvements by defining appropriate Service Level Indicators (SLIs), analyzing incident trends to identify root causes, and implementing permanent fixes to prevent recurrence

Optimizing system performance and ensuring scalability to meet new demands. Align platform-specific SRE objectives with overall reliability goals.

Develop and implement automation strategies across all platforms to enhance efficiency, reduce manual interventions, and improve system reliability.

Steer automation initiatives across platforms to boost operational efficiency, minimize manual tasks, and strengthen system reliability.

Promote the utilization of observability and automation tools (Dynatrace, Azure Monitor, Terraform, Ansible, etc.) and ensure a unified approach to monitoring, performance tuning, and improvements across platforms.

4. Key Metrics:

Digital Products Performance & Availability

Mean Time to Recovery (MTTR)

Reduction in User-Facing Incidents

Number of Automations

Adoption of SRE Practices Across Platforms

ARE THESE YOUR SECRET INGREDIENTS?

Bachelor's or Master's degree in Computer Science, Engineering or a related field.

5+ years of experience in Site Reliability Engineering, DevOps, or a similar role.

3+ years leading team in IT Operations or Development

Proven track record in administering full-stack technology environments in enterprise landscape, including but not limited to:

- Cloud Platforms: Microsoft Azure or similar (Google Cloud, AWS).- Operating Systems: Linux and Windows Server platforms.- Familiarity with enterprise-grade hardware from IBM, Dell, HPE, and Cisco.- Databases: SQL, MongoDB, etc.- Storage Solutions: experience with enterprise storage technologies including SAN and NAS, supporting high-availability and backup strategies.- Networking: Cisco and FortiGate network devices, firewalls, load balancers, and VPNs.

Exposure to business-critical applications and platforms such as SAP (S4Hana, MDG), MS Dynamics or similar enterprise systems

Hands-on expertise in system monitoring and observability (e.g. Dynatrace, Datadog, Splunk), automation (Terraform, Ansible, etc.) and performance tuning utilizing industry standard tools.

Leadership & Influence: Strong ability to lead distributed teams, manage priorities, and influence stakeholders to achieve adoption and value delivery. Comfortable leading teams through change (new processes, tooling, and cultural differences)

Collaboration & Communication: Excellent communication and collaboration skills, capable of working effectively across different hierarchical levels. Able to articulate complex concepts clearly to both technical and non-technical audiences.

Problem Solving & Critical Thinking: Superior troubleshooting skills and proactive approaches to managing complex issues. Being able to resolve conflicts diplomatically.

Technical Acumen: Strong technical foundation to engage deeply with engineering teams.

Operational Mindset: Focused on delivering reliability, stability, and operational excellence.

ABOUT YOUR NEW TEAM:We are Coca-Cola Hellenic, a growth-focused consumer goods business and strategic bottling partner of the Coca-Cola Company. We bottle, distribute and sell an unrivalled range of products in 29 markets in Europe, Africa and Eurasia. As we do, we create value for all stakeholders, support socio-economic growth and build a more positive environmental impact.We bring together more than 30,000 people from over 70 nationalities, coming from five continents. The diversity of our markets, from mature to emerging economies, provides a wide range of attractive opportunities for growth.We nurture our talents. We give opportunities to people across all functions and levels, as well as different geographies, backgrounds and education. We are willing to take a risk on the people we believe in, even if they don't have the perfect experience. We have faith in what every person can be.And although we have so much to be proud of, we always stay humble. We believe the real magic happens - for us and for you - when we OPEN UP.AT COCA-COLA HBC, DIVERSITY HELPS US THRIVEAt Coca-Cola HBC, we are an inclusive employer that thrives on diversity. This means our environment provides equal opportunities for all, regardless of race, color, religion, age, disability, sexual orientation, or gender identity. Join us in nurturing a culture where everyone belongs and contributes to our collective success.BenefitsBonus incentivesCoaching and mentoring programsMedical InsuranceWork with iconic brandsBrands

Coca-Cola HBC

تقدم الآن