Job Description
Frasers Group is a global retail organisation delivering digital innovation and distinctive store experiences across sports, premium, and luxury brands. The organisation operates a large digital ecosystem supporting multiple high-traffic websites and mobile applications for well-known retail brands in the UK and beyond.
Location: Weighbridge Rd, Shirebrook, Mansfield NG20, United Kingdom
Work Mode: Hybrid (Remote work available)
Employment Type: Full-time
Category: Technology
Key Responsibilities
- Lead a DevOps/Platform engineering squad responsible for infrastructure-as-code, CI/CD, observability, and cloud operations.
- Design and implement scalable, secure, and highly available cloud environments (primarily Azure) using Terraform and modern DevOps tooling.
- Contribute hands-on (approximately 30% of time) to automation scripts, IaC, monitoring configurations, and critical platform improvements.
- Drive effective incident management, including rapid resolution, root cause analysis, and reliability improvements.
- Champion SRE practices such as SLAs, SLOs, SLIs, and error budgets to improve platform reliability.
- Coach and mentor engineers through code reviews, pairing sessions, and architectural guidance.
- Collaborate with engineering and product stakeholders to define platform roadmaps, priorities, and delivery plans.
- Continuously improve CI/CD pipelines, GitHub workflows, release strategies, and operational playbooks.
- Promote a strong DevOps culture focused on automation, security, observability, and accountability.
- Support effective Agile delivery practices for operations-focused engineering teams.
Requirements
- Proven experience leading DevOps or SRE teams in production environments.
- Strong expertise in cloud platforms, with preference for Microsoft Azure (identity, networking, cost management, and resource provisioning).
- Hands-on experience with Terraform and Infrastructure-as-Code best practices.
- Practical experience with containerisation and orchestration tools such as Docker and Kubernetes (AKS).
- Strong experience with CI/CD systems and GitHub Actions or similar tools.
- Experience designing and operating highly available, secure, and observable distributed systems.
- Working knowledge of monitoring and alerting tools such as Prometheus, Grafana, Azure Monitor, Datadog, or Honeycomb.
- Solid understanding of RESTful APIs, event-driven systems, and caching technologies such as Redis.
- Strong communication and stakeholder engagement skills across technical and non-technical teams.
Desirable Skills
- Familiarity with SRE frameworks including SLAs, SLOs, and error budgets.
- Experience with incident management tools such as PagerDuty or Opsgenie.
- Knowledge of DevSecOps practices including secrets management and zero-trust architecture.
- Exposure to MACH or composable architecture.
- Experience with cloud cost optimisation and FinOps practices.
- Understanding of CDN configuration, edge delivery networks, and performance optimisation.
- Experience with service mesh, API gateways, or policy management tools.

