DevOps Tech Lead

Job Description

Frasers Group is a global retail organisation delivering digital innovation and distinctive store experiences across sports, premium, and luxury brands. The organisation operates a large digital ecosystem supporting multiple high-traffic websites and mobile applications for well-known retail brands in the UK and beyond.

Location: Weighbridge Rd, Shirebrook, Mansfield NG20, United Kingdom
Work Mode: Hybrid (Remote work available)
Employment Type: Full-time
Category: Technology

Key Responsibilities

  • Lead a DevOps/Platform engineering squad responsible for infrastructure-as-code, CI/CD, observability, and cloud operations.
  • Design and implement scalable, secure, and highly available cloud environments (primarily Azure) using Terraform and modern DevOps tooling.
  • Contribute hands-on (approximately 30% of time) to automation scripts, IaC, monitoring configurations, and critical platform improvements.
  • Drive effective incident management, including rapid resolution, root cause analysis, and reliability improvements.
  • Champion SRE practices such as SLAs, SLOs, SLIs, and error budgets to improve platform reliability.
  • Coach and mentor engineers through code reviews, pairing sessions, and architectural guidance.
  • Collaborate with engineering and product stakeholders to define platform roadmaps, priorities, and delivery plans.
  • Continuously improve CI/CD pipelines, GitHub workflows, release strategies, and operational playbooks.
  • Promote a strong DevOps culture focused on automation, security, observability, and accountability.
  • Support effective Agile delivery practices for operations-focused engineering teams.

Requirements

  • Proven experience leading DevOps or SRE teams in production environments.
  • Strong expertise in cloud platforms, with preference for Microsoft Azure (identity, networking, cost management, and resource provisioning).
  • Hands-on experience with Terraform and Infrastructure-as-Code best practices.
  • Practical experience with containerisation and orchestration tools such as Docker and Kubernetes (AKS).
  • Strong experience with CI/CD systems and GitHub Actions or similar tools.
  • Experience designing and operating highly available, secure, and observable distributed systems.
  • Working knowledge of monitoring and alerting tools such as Prometheus, Grafana, Azure Monitor, Datadog, or Honeycomb.
  • Solid understanding of RESTful APIs, event-driven systems, and caching technologies such as Redis.
  • Strong communication and stakeholder engagement skills across technical and non-technical teams.

Desirable Skills

  • Familiarity with SRE frameworks including SLAs, SLOs, and error budgets.
  • Experience with incident management tools such as PagerDuty or Opsgenie.
  • Knowledge of DevSecOps practices including secrets management and zero-trust architecture.
  • Exposure to MACH or composable architecture.
  • Experience with cloud cost optimisation and FinOps practices.
  • Understanding of CDN configuration, edge delivery networks, and performance optimisation.
  • Experience with service mesh, API gateways, or policy management tools.