Ifs1
Principal DevOps Engineer
Salary
Job description
About the role
We are looking for a Principal DevOps Engineer to own and evolve the core infrastructure that underpins our cloud‑native, AI‑enabled SaaS platforms.
This role is about building platforms that scale, not firefighting. You will design, operate, and continuously improve a secure, highly available Kubernetes‑based platform that enables product engineering teams to deploy, operate, and evolve services safely and independently.
You will work closely with software engineers, product teams, and security stakeholders to embed best‑in‑class DevOps and platform engineering practices across the organisation.
Mission
- Build and operate secure, scalable, highly available cloud infrastructure
- Enable product teams through automation, self‑service, and clear standards
- Raise the bar on reliability, security, observability, and deployment quality
- Act as a technical leader across platform and infrastructure initiatives
What success looks like
You will be accountable for outcomes such as:
Highly available, fault‑tolerant platforms
- All containerised services are deployed with appropriate replication, resilience, and resource limits
- Workloads are designed for multi‑zone availability and safe failure modes
Zero‑downtime, high‑quality delivery
- CI/CD pipelines support safe deployment patterns (e.g. rolling, canary, fast rollback)
- Deployment‑related incidents are eliminated or rapidly mitigated
Empowered engineering teams
- Engineers can diagnose and resolve the majority of platform‑related issues independently
- Clear standards, tooling, and automation reduce cognitive load and friction
Strong security posture
- Infrastructure and workloads follow security‑by‑default principles
- Vulnerabilities are proactively identified, prioritised, and remediated
- Platform security tooling is continuously maintained and improved
Comprehensive observability
- All critical services are monitored with meaningful alerts and dashboards
- Teams have access to self‑service monitoring and alerting capabilities
Key responsibilities
- Design, build, and operate cloud infrastructure using Infrastructure as Code
- Own and evolve Kubernetes platforms, including workload standards and deployment models
- Develop and maintain CI/CD pipelines and GitOps workflows
- Embed security best practices across infrastructure, pipelines, and runtime environments
- Improve platform reliability, monitoring, and incident response workflows
- Act as a technical leader and mentor for engineers using the platform
- Partner with product and engineering teams to anticipate future platform needs
Why join us?
- Own and shape a modern platform engineering capability
- Work on real production systems supporting AI‑enabled SaaS products
- High trust, high autonomy engineering culture
- Opportunity to influence platform strategy as the organisation scales
Essential skills and experience
- Proven experience building and operating cloud‑native platforms at scale
- Strong hands‑on experience with:
- Kubernetes & containerised workloads
- Infrastructure as Code (e.g. Terraform)
- CI/CD pipelines and GitOps‑style delivery
- Deep understanding of:
- High availability, fault tolerance, and scaling strategies
- Secure infrastructure design and operational security practices
- Experience running production platforms on public cloud (GCP preferred; AWS acceptable)
- Strong troubleshooting skills across distributed systems
- Ability to explain complex technical concepts to non‑specialist audiences
- Exposure to AI/ML or LLM‑based workloads in production environments
Technologies you’ll work with
- Google Cloud Platform (GCP)
- Kubernetes (GKE), Docker
- Terraform
- Git‑based CI/CD pipelines
- GitOps tooling (e.g. Argo CD)
- Observability tooling (metrics, logging, alerting)
- Modern AI‑enabled workloads and services
Nice to have
- Experience with service mesh technologies (e.g. Istio)
- Experience with Kubernetes Gateway API or modern ingress patterns
- Familiarity with Redis, PostgreSQL, or managed cloud data services
We embrace flexibility and hybrid work opportunities to support diverse needs and lifestyles, while also valuing inclusive workplace experiences. By fostering a sense of community, we drive innovation, strengthen connections, and nurture belonging. Our commitment ensures you can work in a way that suits you best, while also engaging with colleagues to share ideas and build meaningful relationships.
Explore more
Similar jobs
Senior Software Engineer (IGT1)
Ifs1
Software Engineer – Robotics & Systems
Pathrobotics
Software Engineer – AI & ML Infrastructure
Pathrobotics
E-Learning Developer
ProSidian Consulting, LLC
Senior Software Engineer, Backend - Platform (Payment Rails)
Coinbase
Staff Software Engineer
Fivetran