Clearwateranalytics

Clearwateranalytics

Staff Cloud Engineer

Role

Staff Cloud Engineer

Location

India

Job type

Full time

Posted

13 hours ago

Share this job

Salary

Not disclosed by employer

Job description

Cloud Architecture & Infrastructure

  • Design, architect, and implement scalable, secure, and highly available cloud infrastructure on AWS across multi-account, multi-region environments.
  • Define and enforce cloud architecture standards, best practices, and governance policies using AWS Organizations, Control Tower, and SCPs.
  • Build and maintain Infrastructure as Code (IaC) using Terraform and AWS CloudFormation — writing reusable modules consumed across all product teams.
  • Improve and optimize cloud environments for cost, performance, and reliability — owning FinOps practices including Savings Plans, Spot strategy, and Graviton adoption.
  • Collaborate with engineering, data, and security teams to build resilient distributed systems.
  • Drive innovation and continuous improvement initiatives across the platform.
  • Design, deploy, and manage production EKS clusters at multi-tenant financial-services scale.
  • Plan and execute cluster upgrades, patching, and Kubernetes version lifecycle management with zero customer impact.
  • Build and maintain internal Helm chart libraries and GitOps-driven cluster configuration using ArgoCD or Flux.
  • Implement zero-trust network principles and enforce IAM least-privilege across all AWS accounts.
  • Drive SRE practices: define and enforce SLOs for EKS, API Gateway etc.
  • Lead incident response, postmortem analysis, and blameless RCA processes for platform-level outages.
  • Build chaos engineering exercises and disaster recovery testing across availability zones and regions.
  • Partner with software engineering teams to deliver end-to-end solutions from design through production.
  • Evaluate new AWS services and open-source tooling to continuously improve infrastructure capabilities.

Required Qualifications

  • Strong, hands-on experience with AWS cloud services: EC2, VPC, IAM, EKS, S3, CloudWatch, API Gateway, Route 53, and more.
  • Proven experience operating Amazon EKS in production: cluster lifecycle, RBAC, IRSA, node groups, and autoscaling.
  • Proficiency in Infrastructure as Code with Terraform and AWS CloudFormation.
  • Solid understanding of containerization: Docker, Kubernetes architecture, and container lifecycle management.
  • Experience with monitoring and logging tools: Prometheus, Grafana, Dynatrace, OpenSearch, ELK/Loki.
  • Strong Linux/Unix systems administration and scripting in Bash, Python, or similar.
  • Deep knowledge of cloud security best practices: IAM, RBAC, secrets management, and network security.
  • Solid networking fundamentals: VPCs, subnets, load balancing, DNS, and Kubernetes ingress controllers.
  • Ability to troubleshoot distributed systems and debug complex production issues at scale.
  • Strong problem-solving skills with the ability to drive technical decisions across teams in a fast-paced environment.

Preferred Skills

  • AWS Certifications: Solutions Architect Professional or DevOps Engineer Professional.
  • Kubernetes Certifications: CKA or CKAD.
  • Experience with Helm and GitOps tools (ArgoCD, Flux).
  • Experience with Rancher/ArgoCD or similar tools for EKS node provisioning.
  • Exposure to microservices architecture and distributed systems at scale.
  • Experience with AWS API Gateway and Lambda Authorizers for JWT/OIDC-based auth flows.
  • Background in cost optimization and performance tuning (Graviton, Spot, Savings Plans).
  • Familiarity with CIAM/identity federation: OIDC, OAuth2, SAML, Auth0 integration.
  • Understanding AI/ML infrastructure: model training pipelines, deployment on EKS, and model monitoring.
Resume ExampleCover Letter Example

Explore more