TMS
SRE/ DevOps Engineer
Salary
Job description
SRE/DevOps Engineer
Toronto – Hybrid
Role Summary
We are seeking a highly skilled Site Reliability Engineer (SRE) / DevOps Engineer to support enterprise cloud and DevOps transformation initiatives. The ideal candidate should have strong expertise in implementing DevOps solutions, cloud-native infrastructure, CI/CD automation, Kubernetes platform engineering, GitOps deployment models, security integration, and observability practices across AWS and Azure environments.
The role requires hands-on experience in modern DevOps practices with strong focus on:
- CI/CD implementation using GitHub and GitHub Actions
- DevSecOps integration using JFrog and Veracode
- Infrastructure as Code using Terraform
- Kubernetes platform engineering on EKS
- GitOps deployment using ArgoCD
- AWS cloud-native services and automation
Key Responsibilities
DevOps & CI/CD Engineering
- Design, implement, and maintain enterprise-grade CI/CD pipelines using GitHub and GitHub Actions.
- Automate build, deployment, testing, and release management processes.
- Implement branching strategies, pull request governance, and release approvals.
- Integrate DevSecOps controls into CI/CD pipelines.
- Manage source code repositories and deployment automation workflows.
DevSecOps & Security Integration
- Integrate security tools such as JFrog Xray and Veracode into CI/CD pipelines.
- Perform artifact and container image vulnerability scanning.
- Implement security best practices for cloud and container platforms.
- Ensure compliance with enterprise security standards.
Cloud & Infrastructure Engineering
- Deploy and manage applications across AWS cloud platforms.
- Provision and manage infrastructure using Terraform and CloudFormation.
- Support cloud-native and serverless deployments.
- Manage infrastructure scalability, resiliency, and availability.
Kubernetes & GitOps Platform Engineering
- Deploy and manage containerized applications on Amazon EKS.
- Implement GitOps deployment practices using ArgoCD.
- Support Kubernetes cluster administration, scaling, and monitoring.
- Implement service mesh technologies such as Istio, Envoy, or Gloo.
AWS Cloud Services Management
- Work extensively with AWS services including:
- EKS
- EC2
- Lambda
- IAM
- RDS
- ElastiCache
- S3
- Route53
- ALB/NLB
- AWS Batch
- Secrets Manager
- SSM Parameter Store
- KMS
- Configure secure networking, DNS, encryption, and access management.
Monitoring & Reliability Engineering
- Implement monitoring, logging, and observability solutions using Datadog and Sumo Logic.
- Monitor application metrics, logs, APM, and infrastructure health.
- Configure proactive alerting and incident management workflows.
- Perform root cause analysis and production support activities.
Agile Delivery & Collaboration
- Work within Agile/Scrum teams using Jira and Confluence.
- Participate in sprint planning, backlog grooming, and release management.
- Collaborate with development, QA, cloud, and security teams.
Required Technical Skills
High-Level Skills
- DevOps & CI/CD
- GitHub & GitHub Actions
- JFrog & Veracode
- Terraform
- Amazon EKS
- ArgoCD
- AWS Cloud Services
- Kubernetes
- GitOps
- DevSecOps
- Infrastructure as Code (IaC)
CI/CD & Source Control
- GitHub
- GitHub Actions
- Bamboo
- Maven
- Gradle
- NPM
- MSBuild
Security & Artifact Management
- JFrog
- Nexus
- Veracode
- SonarQube
Cloud Platforms
- AWS
- Azure (Good to Have)
Infrastructure as Code
- Terraform
- CloudFormation
- Sceptre
Containerization & Orchestration
- Kubernetes
- EKS
- Docker
- ArgoCD
- Istio
- Envoy/Gloo
Monitoring & Observability
- Datadog
- Sumo Logic
Preferred Qualifications
- Experience supporting enterprise-scale DevOps and cloud transformation programs.
- Strong understanding of SRE principles and operational excellence.
- Experience with microservices architecture and distributed systems.
- Exposure to AI-enabled DevOps tools and automation.
- Multi-cloud deployment experience is an added advantage.
All your information will be kept confidential according to EEO guidelines.


