This job offer is not available in your country.

Senior AI / ML Platform Engineer

team.blue GlobalWorcester, England, .GB

1 day ago

Job type

Full-time

Quick Apply

Job description

The most trusted digital enabler

team.blue is a leading digital enabler for companies and entrepreneurs. It serves over 3.3 million customers in Europe and has more than 3,000 experts to support them. Its goal is to shape technology and to empower businesses with innovative digital services.

Click here to read more about team.blue

Company

team.blue is an ecosystem of 60+ successful brands working together across 22 European countries to provide its 3.5 million SMB customers with everything they need to succeed online by offering best-in-class expertise and services.

team.blue's brands are a mix of traditional hosting businesses that offer services from domain names, email, shared hosting, e-commerce, and server hosting solutions and, as specialist SaaS providers, adjacent products such as compliance, marketing tools, and team collaboration products. This broad product offering makes it a one-stop partner for online businesses and entrepreneurs across Europe.

Position

We are looking for an experienced Senior AI / ML Platform Engineer to design, build, and maintain our machine learning and AI infrastructure platform. This role is critical to enabling our data science and AI teams to deploy, scale, and manage ML models efficiently across multi-GPU environments. You'll be responsible for creating robust, scalable platforms that support the full ML lifecycle from model training to inference, with a particular focus on LLM deployment and management.

Key Responsibilities

Platform Development & Management

Design and implement scalable ML / AI platforms supporting model deployment across multi-GPU nodes
Build and maintain infrastructure for LLM inference serving, including optimization for latency and throughput
Develop automated deployment pipelines for machine learning models using containerization and orchestration technologies
Create self-service tools and APIs that enable data scientists to deploy models independently

Infrastructure & Operations

Manage and optimize GPU cluster resources, ensuring efficient utilization and cost management

Implement monitoring, logging, and alerting systems for ML workloads and model performance

Design disaster recovery and backup strategies for critical ML infrastructure

Maintain high availability and reliability standards for production ML services

DevOps & Automation

Build CI / CD pipelines specifically tailored for ML model deployment and updates

Automate infrastructure provisioning using Infrastructure as Code (IaC) principles

Implement model versioning, rollback capabilities, and A / B testing frameworks

Develop automated scaling solutions for varying inference workloads

Collaboration & Support

Work closely with data science teams to understand requirements and optimize deployment workflows

Provide technical guidance on best practices for model deployment and infrastructure usage

Collaborate with security teams to implement secure ML model serving practices

Document platform capabilities, procedures, and troubleshooting guides

Profile

Professional Experience

4+ years of experience in Platform engineering, DevOps, or infrastructure roles

2+ years of experience specifically with ML / AI infrastructure or platforms

Technical Skills

Cloud Platforms : 4+ years experience with AWS, Azure, or GCP, particularly GPU-enabled services

Containerization : Proficiency with Docker and Kubernetes, including GPU scheduling and resource management

Infrastructure as Code : Experience with Terraform, CloudFormation, or similar tools

Programming : Strong skills in Python and at least one additional language (Go, Java, or Rust)

ML Frameworks : Familiarity with PyTorch, TensorFlow, and model serving frameworks (TorchServe, TensorFlow Serving, etc.)

Platform & Operations Experience

Experience building and maintaining production ML platforms or similar infrastructure (KubeFlow, MLFlow, SageMaker, etc)

Knowledge of GPU computing, CUDA, and multi-GPU distributed computing

Understanding of ML model lifecycle management and MLOps practices

Experience with monitoring tools (Prometheus, Grafana, ELK stack)

Experience with streaming data processing (Kafka, Kinesis, Pulsar)

Familiarity with service mesh technologies and API gateways

AI / ML Knowledge

Understanding of large language models (LLMs) and inference optimization techniques

Knowledge of model quantization, pruning, and other optimization methods

Experience with distributed training and inference across multiple GPUs / nodes

Familiarity with vector databases and embedding storage solutions

Rig ht to Work

At any stage, please be prepared to provide proof of eligibility to work in the country you’re applying for. Unfortunately, we are unable to support relocation packages or sponsorship visas.

ESG

“At team.blue, our commitment to caring for the environment and each other is at the heart of everything we do. Our latest impact report showcases our ongoing ESG efforts and ambitious sustainability goals. Interested in learning more about our dedication to making a positive impact? Check it out here .”

" Come as you are"

Everyone is welcome here. Diversity & Inclusion are at our core. Far above any technical competence, we value respect, openness, and trusted collaboration. We do not tolerate intolerance.

Create a job alert for this search

Senior Platform Engineer • Worcester, England, .GB

Related jobs

Promoted

Senior Machine Learning Engineer

Anson Mccade CareersCheltenham, Gloucestershire, UK

Permanent

Senior Machine Learning Engineer.Gloucester, South West - United Kingdom.Senior Machine Learning Engineer - Defence & Security. Gloucester Area | Hybrid (2-3 days on-site as required).To be eligible...Show moreLast updated: 14 days ago

Promoted

Machine Learning Engineer - AI for Grid Innovation & Energy Transition (Energy Sector Experience Req

GE VernovaStafford, England, United Kingdom

Full-time

GE Vernova is accelerating the path to more reliable, affordable, and sustainable energy, while helping our customers power economies and deliver the electricity that is vital to health, safety, se...Show moreLast updated: 30+ days ago

Promoted

Senior Model-Based Systems Engineer (CAMEO)

AltenWest Midlands, West Midlands, UK

Full-time

Are you passionate about engineering? Do you want to make a difference?.We partner with industry leaders across sectors including Aeronautics, Aerospace, Defence, Naval, Automotive, Energy, Rail, a...Show moreLast updated: 14 days ago

Promoted

Lead Machine Learning Research Engineer, Applied AI

ZaiziCheltenham, England, United Kingdom

Full-time

Work on exciting public sector projects and make a positive difference in people's lives.At Zaizi, we thrive on solving complex challenges through creative thinking and the latest tools and tech.As...Show moreLast updated: 6 days ago

Promoted

Senior Analytics Engineer

SmartSurveyTewkesbury, England, United Kingdom

Full-time

We are a UK-based SaaS company launched in 2010, with a mission to be the leading survey platform that gives every person a voice. This is a hands-on, cross-functional role at the heart of SmartSurv...Show moreLast updated: 30+ days ago

Promoted

AI Engineer

The Big Phone Store UKWolverhampton, England, United Kingdom

Full-time +1

Artificial Intelligence (AI) Engineer.Hybrid (2 days remote and 3 days in-office).Salary : Negotiable dependent on experience, above market rate. Responsibilities could include but aren’t limited to : ...Show moreLast updated: 30+ days ago

Promoted

Machine Learning Engineer

In Technology GroupBirmingham, England, United Kingdom

Full-time

Get AI-powered advice on this job and more exclusive features.This range is provided by In Technology Group.Your actual pay will be based on your skills and experience — talk with your recruiter to...Show moreLast updated: 30+ days ago

Promoted
New!

Senior Python Developer Azure - Scaling AI Platform. £90K. REMOTE

Recruitment Revolution CareersBirmingham, West Midlands, UK

Remote

Full-time

Were building something smart - and were moving fast.AI-driven platform thats reshaping how businesses operate.Built natively on Azure and powered by the latest in Microsoft tech, our solution help...Show moreLast updated: 1 hour ago

Promoted

Lead C# / Principal C# Engineer, Leadership, Home Based

Fdo ConsultingCheltenham, Gloucestershire, UK

Remote

Full-time

Lead C# / Principal C# Software Engineer,.SaaS, Hands-on engineering role with proven ability to mentor, technically develop a team, take the technical leadership, etc. Full stack but heavily backen...Show moreLast updated: 4 days ago

Promoted
New!

Senior Python Developer Azure - Scaling AI Platform. £90K. REMOTE

RecruitmentRevolution.comBirmingham, West Midlands (County), UK

Remote

Full-time

We’re building something smart - and we’re moving fast.AI-driven platform that’s reshaping how businesses operate.Built natively on Azure and powered by the latest in Microsoft te...Show moreLast updated: 1 hour ago

Promoted

AI Engineer

Tenth Revolution GroupBirmingham, West Midlands (County), United Kingdom

Full-time +1

A growing Microsoft Partner Consultancy are looking for a passionate AI Engineer / Consultant join their impressive team. The role is home-based, with some element of travel to client sites when req...Show moreLast updated: 19 days ago

Promoted

Senior Software Engineer, AI Model serving - Birmingham, United Kingdom

SpeechifyBirmingham, England, United Kingdom

Full-time +1

Senior Software Engineer, AI Model serving - Birmingham, United Kingdom.Senior Software Engineer, AI Model serving - Birmingham, United Kingdom. Senior Software Engineer, AI Model serving - Birmingh...Show moreLast updated: 30+ days ago

Promoted

Senior Machine Learning Engineer

Anson MccadeCheltenham, England, United Kingdom

Permanent

Senior Machine Learning Engineer - Defence & Security.GBP • Gloucester Area • Hybrid (2-3 days on-site as required) • Permanent. Location : Gloucester, South West - United Kingdom.To be eligible, can...Show moreLast updated: 15 days ago

Promoted

VP Engineering - Head of Software Development. AI Martech SaaS

Recruitment Revolution CareersBirmingham, West Midlands, UK

Full-time

Welcome to ASK BOSCO®, thanks for stopping by….Before we talk perks, equity, or growth stats, let's flip the script.What's driving your search right now, what's prompting you to take the next big s...Show moreLast updated: 14 days ago

Promoted

Analysis and AI Product Lead

Hoare LeaBirmingham, England, United Kingdom

Full-time

Social network you want to login / join with : .Analysis and AI Product Lead, Birmingham.Nationwide (Bristol preferred).Hoare Lea is a human-centric and planet-conscious engineering consultancy.We offe...Show moreLast updated: 30+ days ago

Promoted

Data Engineer Azure / Databricks / ML / AI West Midlands

MYO Talent CareersBirmingham, West Midlands, UK

Permanent

Data Engineer / Data Engineering / Lakehouse / Delta Lake / Data Warehousing / ETL / Azure / Azure Databricks / Python / SQL / ML / Machine Learning / AI / Artificial Intelligence / Based in the We...Show moreLast updated: 30+ days ago

Promoted

AI Software Engineer

Version 1Birmingham, England, United Kingdom

Full-time

We are looking for a curious and capable AI Engineer who is excited to build impactful AI solutions.This is an exciting opportunity for an experienced professional with a strong software engineerin...Show moreLast updated: 11 days ago

Promoted

Machine Learning Engineer

Anson Mccade CareersCheltenham, Gloucestershire, UK

Permanent

Gloucester, South West - United Kingdom.Machine Learning Engineer - Defence & Security.Gloucester Area | Hybrid (2-3 days on-site as required). To be eligible, candidates must hold sole British nati...Show moreLast updated: 14 days ago

Data Engineer - Azure, Databricks, ML / AI

Pinewood.AIBirmingham, England, GB

Full-time

Quick Apply

AI is looking for a skilled and experienced Data Engineer to help shape the future of data solutions in the automotive technology space. In this role, you’ll be instrumental in developing scalable, ...Show moreLast updated: 30+ days ago

Promoted

Machine Learning Engineer

The Big Phone Store UKWolverhampton, England, United Kingdom

Full-time +1

Machine Learning Engineer – Hybrid Role.Location : Wolverhampton (Hybrid).Salary : Starting at £25,000 per year.Are you passionate about AI and machine learning, and eager to apply your skills to rea...Show moreLast updated: 30+ days ago