MCPNew: Mokaru MCP server is live
Version1

Version1

AI Infrastructure Engineer / AI Infrastructure Consultant

Company

Version1

Role

AI Infrastructure Engineer / AI Infrastructure Consultant

Job type

Full-time

Found on Mokaru

🔥Recently

Share this job

Salary

Not disclosed by employer

Job description

About the Role

We are building a production-grade AI platform designed to support advanced machine learning systems at scale. We are seeking a senior infrastructure engineer or AI infrastructure consultant who combines deep platform expertise with hands-on experience using modern AI coding agents to accelerate development.

This role is ideal for someone who has architected and operated enterprise AI systems, understands modern MLOps and cloud-native infrastructure, and actively leverages AI-assisted development tools to move faster without sacrificing reliability, security, or performance.

You will help design, harden, and scale the infrastructure backbone supporting AI workloads across training, inference, and application layers.

What You Will Do

  • Architect and implement scalable infrastructure for AI and ML workloads (training, evaluation, inference).
  • Design and operate Kubernetes-based platforms for multi-tenant, production AI systems.
  • Build and refine MLOps pipelines covering model versioning, experiment tracking, CI/CD, deployment, monitoring, and rollback.
  • Establish DevOps best practices across infrastructure, application, and ML layers.
  • Lead security-first infrastructure design (access control, secrets management, isolation, observability, auditability).
  • Deploy and operate enterprise-grade production systems with strong uptime and reliability standards.
  • Leverage modern AI coding agents and developer copilots to accelerate engineering workflows.
  • Partner with ML engineers and application teams to translate research and product requirements into scalable infrastructure capabilities.

What We Are Looking For

  • 8-12+ years of experience in infrastructure, platform engineering, or distributed systems.
  • Proven experience building and operating enterprise-grade production systems.
  • Deep hands-on expertise with Kubernetes in production (autoscaling, networking, upgrades, reliability patterns).
  • Strong background in MLOps and ML platform lifecycle management.
  • Experience with cloud platforms (AWS, GCP, or Azure) and Infrastructure-as-Code (Terraform, Pulumi, etc.).
  • Practical, hands-on use of AI coding agents / AI-assisted development tools.
  • Strong programming ability in Go, Python, or similar infrastructure-oriented languages.

Nice to Have

  • Experience supporting GPU workloads and large-scale training/inference.
  • Familiarity with enterprise security standards (SOC2, ISO, zero-trust architectures).
  • Experience building internal developer platforms serving multiple teams.
  • Background supporting AI systems in regulated or high-reliability environments.

Why Version 1? 

 At Version 1, we believe in providing our employees with a comprehensive benefits package that prioritises their wellbeing, professional growth, and financial stability. 

  • Share in our success with our Quarterly Performance-Related Profit Share Scheme, where employees collectively benefit from a share of our company's profits 
  • Strong Career Progression & mentorship coaching through our Strength in Balance & Leadership schemes with a dedicated quarterly Pathways Career Development programme 
  • Flexible/remote working, Version 1 is tremendously understanding of life events and people’s individual circumstances and offer flexibility to help achieve a healthy work life balance 
  • Financial Wellbeing initiatives including; Pension, Private Healthcare Cover, Life Assurance, Financial advice and an Employee Discount scheme 
  • Employee Wellbeing schemes including Gym Discounts, Bike to Work, Fitness classes, Mindfulness Workshops, Employee Assistance Programme and much more. Generous holiday allowance, enhanced maternity/paternity leave, marriage/civil partnership leave and special leave policies 
  • Educational assistance, incentivised certifications, and accreditations, including AWS, Microsoft, Oracle, and Red Hat 
  • Reward schemes including Version 1’s Annual Excellence Awards & ‘Call-Out’ platform. 
  • Environment, Social and Community First initiatives allow you to get involved in local fundraising and development opportunities as part of fostering our diversity, inclusion and belonging schemes. 

And many more exciting benefits… drop us a note to find out more.    

Version 1 is an equal opportunities employer. 

We are committed to building a diverse, inclusive and respectful workplace where everyone feels valued and able to thrive. We welcome applications from people of all backgrounds, identities and lived experiences, and we value the different perspectives people bring including those shaped by disability and neurodiversity. 

We want every candidate to have a positive and accessible recruitment experience. If you need reasonable adjustments at any stage of the process, please contact your recruiter at Version 1. We will consider all requests carefully, respectfully and confidentially. 

Video links: https://www.youtube.com/watch?v=F_d3ELTH5zo

Resume ExampleCover Letter Example

Explore more