This job offer is not available in your country.

Site Reliability Engineer

Orgvue LimitedLondon, England, United Kingdom

30+ days ago

Job type

Full-time

Job description

Orgvue is an organisational design and planning platform that empowers your business to transform its workforce by understanding the work people do and the skills they have. Our platform connects strategy to structure, providing clarity of vision, so you can build a more adaptable, better performing organisation that thrives in a constantly changing world of work.

The world’s largest and best-known enterprises and consulting firms use Orgvue to visualise and model current and future states of the organisation and make faster, more informed decisions. The company is headquartered in London, with offices in Philadelphia, The Hague, Toronto, and Sydney.

Role : Principal Site Reliability Engineer

You will be a senior technical leader focused on scaling and hardening our AWS- and Kubernetes-based infrastructure. You will collaborate across product, platform, and operations teams to ensure our systems are reliable, observable, and resilient — even at scale.

This role combines hands-on technical skills with strategic vision, helping us build a world-class reliability culture and a robust engineering foundation for growth. We seek someone with technical expertise, excellent communication skills, and a collaborative spirit.

Responsibilities :

Define and enforce SLOs, SLIs, and error budgets across critical services
Develop and implement cloud infrastructure and tooling strategies
Enhance SRE practices across the organization
Implement robust observability metrics, logs, and traces using our observability tools
Guide the team in building automated, self-healing systems
Own and evolve incident response processes, including on-call practices and post-mortem culture
Mentor engineers on reliability, operational readiness, and scalable infrastructure best practices
Drive Infrastructure as Code (IaC) initiatives using Terraform, Kubernetes, CloudFormation, and GitOps practices
Collaborate with security, DevOps, and software teams to ensure compliance and operational excellence
Evaluate and adopt tools and practices to improve platform performance and reliability

Desired Skills & Experience :

Experience leading SRE transformations

Hands-on expertise with Kubernetes (EKS preferred) in production

Strong experience with AWS core services (EC2, EKS, RDS, S3, ALB / NLB, IAM, CloudWatch, etc.)

Proficiency in Infrastructure as Code using Terraform and knowledge of GitOps workflows

Strong background in observability : metrics, visualization, logging, tracing

Understanding of automation, CI / CD pipelines, deployment automation, and release strategies

Experience with incident management, disaster recovery, root cause analysis, and post-incident reviews

Additional Benefits :

Hybrid working : 1+ days a week in London office

Wellbeing initiatives : coaching, fitness sessions, webinars, Wellbeing day

Subsidised gym membership

Private medical insurance, dental, vision, and life assurance

25 days holiday (increasing to 30)

Summer Fridays (half-days in July and August)

Employer pension contribution of 5% (if you contribute at least 3%)

Season ticket loan

Cycle to Work Scheme

Annual discretionary bonus

Here at Orgvue, we promote individualism and a diverse workforce to build our future success.

J-18808-Ljbffr

Create a job alert for this search

Site Reliability Engineer • London, England, United Kingdom

Related jobs

Site Reliability Engineer

UnitaryLondon, London, United Kingdom

Full-time

Check you match the skill requirements for this role, as well as associated experience, then apply with your CV below.We are a rapidly growing startup developing solutions that blend human expertis...Show moreLast updated: 14 days ago

Promoted

Site Reliability Engineer

ThreddLondon, England, United Kingdom

Full-time

Get AI-powered advice on this job and more exclusive features.Are you passionate about building reliable, scalable, and high-performing systems? Do you thrive on solving complex infrastructure chal...Show moreLast updated: 30+ days ago

Promoted

Site Reliability Engineer

IntappLondon, England, United Kingdom

Full-time

The Intapp Cloud Platform is a rapidly growing collection of cloud services.As part of a global team, the ideal candidate will be able to quickly move between architecture, design, and daily operat...Show moreLast updated: 30+ days ago

Promoted

Site Reliability Engineer

GSRLondon, England, United Kingdom

Full-time

Founded in 2013, GSR is a leading market maker and programmatic trading firm in the fast-evolving world of cryptocurrency trading. With over 200 employees across seven countries, we provide billions...Show moreLast updated: 30+ days ago

Promoted

Site Reliability Engineer

SS&C TechnologiesLondon, England, United Kingdom

Full-time

As a leading financial services and healthcare technology company based on revenue, SS&C is headquartered in Windsor, Connecticut, and has 27,000+ employees in 35 countries.Some 20,000 financial se...Show moreLast updated: 30+ days ago

Promoted

Site Reliability Engineer

XcedeLondon, England, United Kingdom

Full-time

This range is provided by Xcede.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Direct message the job poster from Xcede.A technology-focused, m...Show moreLast updated: 30+ days ago

Promoted

Site Reliability Engineer

AttioLondon, England, United Kingdom

Full-time

This range is provided by Attio.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Attio is on a mission to redefine CRM for the AI era.We’re build...Show moreLast updated: 10 days ago

Promoted

Site Reliability Engineer

Huntress TalentLondon, England, United Kingdom

Full-time

Social network you want to login / join with : .Contract Site Reliability Engineer (1-2 year contract, Hybrid - London, UK). Schedule and monitor real-time trading systems using DevOps methodologies.Mon...Show moreLast updated: 2 days ago

Site Reliability Engineer

KyndrylLondon, United Kingdom

Full-time

At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward – always pushing ...Show moreLast updated: 25 days ago

Promoted

Site Reliability Engineer

JD.comLondon, England, United Kingdom

Full-time

NASDAQ : JD and HKEX : 9618), also known as JINGDONG, has evolved from a pioneering e-commerce platform into a leading technology and service provider with supply chain at its core.Renowned for its s...Show moreLast updated: 28 days ago

Promoted

Site Reliability Engineer

Oscar TechnologyLondon, England, United Kingdom

Temporary

Social network you want to login / join with : .We're working with a fast growing client undergoing rapid expansion, looking for an experienced Site Reliability Engineer (SRE) to join them on a 6-month...Show moreLast updated: 8 days ago

Site Reliability Engineer

ClearScoreLondon, England, United Kingdom

Full-time

ClearScore is expanding and to support our productivity, reliability and efficiency we are expanding our Site Reliability Engineering team. Our SRE team is responsible for building our internal deve...Show moreLast updated: 30+ days ago

Promoted

Site Reliability Engineer

Liberty Charge Ltd.London, England, United Kingdom

Full-time

Social network you want to login / join with : .At Believ, formerly known as Liberty Charge, we believe sustainable transport should be accessible to everyone. We’re a Charge Point Operator (CPO) on a m...Show moreLast updated: 16 days ago

Site Reliability Engineer

Third RepublicLondon, , United Kingdom

Permanent

Do you want to build and manage scaleable, self-healing, globally-distributed systems?.Join this dynamic team of SREs who keep this organisation fast, available, and growing, connecting users to gr...Show moreLast updated: 30+ days ago

Promoted

Site Reliability Engineer

Grid Dynamics International, Inc.London, England, United Kingdom

Full-time

We are seeking a highly motivated and skilled Site Reliability Engineer (SRE) to ensure the reliability, performance, and scalability of the client’s critical Data Platform solutions.In this role, ...Show moreLast updated: 28 days ago

Site Reliability Engineer

Citadel SecuritiesLondon

Full-time

Candidates who have less than 3 years of experience should possess : .Good knowledge of UNIX / Linux command line.Good understanding of the usage of TCP / IP and UDP networking in applications.Basic unde...Show moreLast updated: 30+ days ago

Promoted

Site Reliability Engineer

TransFICCLondon, England, United Kingdom

Full-time +1

Remote First; office location Moorgate, London (flexible remoteworking locations within UK / Europe).Up to £110K + Shares + Benefits. TransFICC is hiring a Site Reliability Engineer to provide high-pe...Show moreLast updated: 30+ days ago

Promoted

Site Reliability Engineer

PythianLondon, England, United Kingdom

Full-time

Get AI-powered advice on this job and more exclusive features.Europe (UK, Macedonia, Poland, Romania, Spain) | Remote | Work from home. At Pythian, we are experts in strategic database and analytics...Show moreLast updated: 11 days ago