Overview
The role involves bridging legacy middleware stability and cloud-native modernization through SRE practices, automation engineering, and DevSecOps to ensure enterprise platform reliability and continuous operational improvement.
Key Responsibilities
- Implement and operationalize SRE practices including SLO/SLI definition, error budget management, observability, and incident response.
- Manage and support legacy middleware technologies (JBoss, Apache, WebSphere, IIS) and drive modernization toward cloud-native architectures.
- Design and maintain CI/CD pipelines, Infrastructure-as-Code (IaC) frameworks using Terraform and GitOps, and automation workflows.
- Build self-service and self-healing solutions, auto-remediation playbooks, chaos engineering exercises, and proactive alerting.
- Monitor platform health through observability tooling (Dynatrace, Splunk) and drive continuous improvement against defined SLOs.
- Lead knowledge transfer, documentation, and runbook development.
- Contribute to the Platform as a Product strategy by embedding SRE principles.
- Ensure compliance with ITSM processes, security standards, and business continuity requirements.
Required Experience
- Master's degree with a minimum of 5 years of relevant experience, or a Bachelor's degree with a minimum of 7 years of relevant experience, or an equivalent combination of education and experience.
- Hands-on SRE experience.
- Proven experience with legacy middleware platforms (JBoss, Apache, WebSphere, IIS) alongside modern application stacks (.NET, Java, NodeJS, Angular).
- Strong proficiency in DevOps/DevSecOps tooling -Terraform, Kubernetes/AKS, Docker, GitOps, CI/CD pipelines, and GitHub/GitLab/Azure Repos.
- Hands-on experience with multi-cloud platforms (Azure and AWS).
- Solid scripting and automation skills in Python, PowerShell, or Bash.
- Experience with monitoring and observability platforms including Dynatrace, Splunk, Prometheus, and Grafana.
- Working experience in Agile and SAFe delivery environments.
- Strong database skills across relational and NoSQL platforms (PostgreSQL, MySQL) with experience in PAAS and COTS solution management.