- Lead and manage the resolution of complex technical issues involving Zafin’s products and Azure cloud environment.
- Design and implement strategic, operational enhancements to improve resiliency and system reliability.
- Conduct in-depth Root Cause Analysis (RCA) for high-severity incidents and drive initiatives to reduce error recurrence.
- Represent the organisation in external client escalation calls, providing expert guidance and solutions.
- Architect and optimise cloud infrastructure for high performance, scalability, and cost-effectiveness.
- Provide thought leadership in managing and scaling container orchestration platforms such as AKS and OpenShift.
- Oversee the implementation of advanced monitoring solutions and integrate predictive analytics for proactive issue resolution.
- Develop and execute automation strategies to streamline operational workflows and incident responses.
- Create and maintain comprehensive documentation of cloud architectures, processes, and incident management strategies.
- Mentor and coach junior engineers, fostering a culture of continuous learning and innovation.
- Drive strategic initiatives, collaborating with cross-functional teams to achieve organisational objectives.