Talent.com
SRE Observability Lead Engineer - Senior Vice President
SRE Observability Lead Engineer - Senior Vice President11037 Citibank, N.A. United Kingdom • London United Kingdom
No longer accepting applications
SRE Observability Lead Engineer - Senior Vice President

SRE Observability Lead Engineer - Senior Vice President

11037 Citibank, N.A. United Kingdom • London United Kingdom
30+ days ago
Job type
  • Full-time
Job description

The SRE Observability Lead Engineer is a hands-on leader responsible for shaping and delivering the future of Observability across Services Technology. This role reports into the Head of SRE Services and sits within a small central enablement team. You will define the long-term vision, build and scale modern observability capabilities across business lines, and lead a small team of SREs delivering reusable observability services.

This is a blended leadership and engineering role – the ideal candidate pairs strategic vision with the technical depth to resolve real-world telemetry challenges across on-prem, cloud, and container-based environments (ECS, Kubernetes, etc.). You’ll work closely with architecture & other engineering functions to not only resolve common challenges affecting SREs aligned to LoBs, but will ensure observability is embedded as a non-functional requirement (NFR) for all new services going live. You will collaborate with platform and infrastructure teams to ensure enterprise-scale, not siloed solutions. You will also be responsible for managing a small, high-impact team of SREs based in your region.

This role requires a comprehensive understanding of observability challenges across Services (Payments, Securities Services, Trade, Digital & Data) and the ability to influence outcomes at the enterprise level. Strong commercial awareness, technical credibility, and excellent communication skills are essential to negotiate internally, influence peers, and drive change. Some external communication may be necessary.

Responsibilities:

  • Define and own the strategic vision and multi-year roadmap for Observability across Services Technology, aligned with enterprise reliability and production goals.

  • Translate strategy into an actionable delivery plan in partnership with Services Architecture & Engineering function, delivering incremental, high-value milestones toward a unified, scalable observability architecture.

  • Lead and mentor SREs across Services, fostering a technical growth and SRE mindset.

  • Build and offer a suite of central observability services across LoBs – including standardized telemetry libraries, onboarding templates, dashboard packs, and alerting standards.

  • Drive reusability and efficiency by creating common patterns and golden paths for observability adoption across critical client flows and platforms.

  • Partner with infrastructure, CTO and other SMBF tooling teams, to ensure observability tooling is scalable, resilient, and avoids duplication (“cottage industries”).

  • Work hands-on to troubleshoot telemetry and instrumentation issues across on-prem, cloud (AWS, GCP, etc.), and ECS/Kubernetes-based environments.

  • Collaborate closely with the architecture function to support implementation of observability NFRs in the SDLC, ensuring new apps go live with sufficient coverage and insight.

  • Support SRE Communities of Practice (CoP) and foster strong relationships with SREs, developers, and platform leads across Services and beyond to accelerate adoption & promote SRE best practices like SLO adoption, Capacity Planning.

  • Use Jira/Agile workflows to track and report on observability maturity across Services LoBs – coverage, adoption, and contribution to improved client experience.

  • Remove inefficiencies and provide solutions to enable unified views of consolidated SLOs for critical E2E client journeys for Payments & other Services critical user journeys.

  • Influence and align senior stakeholders across functions (applications, infrastructure, controls, and audit) to drive observability investment for critical client flows across Services.

  • Represent Services in working groups to influence enterprise observability standards, ensuring feedback from Services is reflected.

  • Lead people management responsibilities for your direct team, including management of headcount, goal setting, performance evaluation, compensation, and hiring.

  • Appropriately assess risk when business decisions are made, demonstrating particular consideration for the firm's reputation and safeguarding Citigroup, its clients and assets, by driving compliance with applicable laws, rules and regulations, adhering to Policy, applying sound ethical judgment regarding personal behaviour, conduct and business practices, and escalating, managing and reporting control issues with transparency, as well as effectively supervise the activity of others and create accountability with those who fail to maintain these standards.

Qualifications:

  • Relevant experience in Observability, SRE, Infrastructure Engineering, or Platform Architecture, including several years in senior leadership roles.

  • Deep expertise in observability tools and stacks such as Grafana, Prometheus, OpenTelemetry, ELK, Splunk, and similar platforms.

  • Strong hands-on experience across hybrid infrastructure, including on-prem, cloud (AWS, GCP, Azure), and container platforms (ECS, Kubernetes).

  • Proven ability to design scalable telemetry and instrumentation strategies, resolve production observability gaps, and integrate them into large-scale systems.

  • Experience leading teams and managing people across geographically distributed locations.

  • Strong ability to influence platform, cloud, and engineering leaders to ensure observability tooling is built for reuse and scale.

  • Deep understanding of SRE fundamentals, including SLIs, SLOs, error budgets, and telemetry-driven operations.

  • Strong collaboration skills and experience working across federated teams, building consensus and delivering change.

  • Ability to stay up to date with industry trends and apply them to improve internal tooling and design decisions.

  • Excellent written and verbal communication skills; able to influence and articulate complex concepts to technical and non-technical audiences.

Education: Bachelor’s or Master’s degree in Computer Science, Engineering, Information Systems, or a related technical field.

What we’ll provide you:

By joining Citi, you will not only be part of a business casual workplace with a hybrid working model (up to 2 days working at home per week), but also receive a competitive base salary (which is annually reviewed), and enjoy a whole host of additional benefits such as:

  • 27 days annual leave (plus bank holidays)

  • A discretional annual performance related bonus

  • Private Medical Care & Life Insurance

  • Employee Assistance Program

  • Pension Plan

  • Paid Parental Leave

  • Special discounts for employees, family, and friends

  • Access to an array of learning and development resources

Alongside these benefits Citi is committed to ensuring our workplace is where everyone feels comfortable coming to work as their whole self, every day. We want the best talent around the world to be energized to join us, motivated to stay and empowered to thrive.

------------------------------------------------------

Job Family Group:

Technology

------------------------------------------------------

Job Family:

Applications Support

------------------------------------------------------

Time Type:

Full time

------------------------------------------------------

Most Relevant Skills

Please see the requirements listed above.

------------------------------------------------------

Other Relevant Skills

For complementary skills, please see above and/or contact the recruiter.

------------------------------------------------------

Create a job alert for this search

SRE Observability Lead Engineer - Senior Vice President • London United Kingdom

Similar jobs
KDB+ Senior Lead Software Engineer - Vice President

KDB+ Senior Lead Software Engineer - Vice President

JPMorganChase • Greater London, England, United Kingdom
Full-time
We have an opportunity to impact your career and provide an adventure where you can push the limits of what's possible.As a Lead Software Engineer at JPMorgan Chase within the Commercial & Investme...Show more
Last updated: 23 days ago • Promoted
VP Engineering

VP Engineering

Publicis Media • Greater London, England, United Kingdom
Full-time
As the VP of Engineering, you will be a key technical leader responsible for implementing engineering frameworks, processes, and AI-driven solutions that enable our performance products to scale gl...Show more
Last updated: 2 days ago • Promoted
Senior SRE Leader: Scale, Observability & IaC

Senior SRE Leader: Scale, Observability & IaC

Orgvue Limited • Greater London, England, United Kingdom
Full-time
A leading software platform in London is seeking a Principal Site Reliability Engineer to focus on scaling and hardening their AWS- and Kubernetes-based infrastructure.The successful candidate will...Show more
Last updated: 23 days ago • Promoted
Lead GenAI Lead Engineer, Innovation Labs – SVP

Lead GenAI Lead Engineer, Innovation Labs – SVP

Citigroup Inc. • Greater London, England, GB
Full-time
We're on the hunt for a highly skilled and experienced senior engineer to lead the design and development of the various AI services as part of the Citi Innovation Labs.The ideal candidate has an e...Show more
Last updated: 7 days ago • Promoted
VP of Site Reliability & Observability Engineering

VP of Site Reliability & Observability Engineering

Blackstone • Greater London, England, United Kingdom
Full-time
A leading global investment firm in London is looking for a Site Reliability Engineer to enhance system reliability and operational efficiency.The candidate will implement observability tools, coll...Show more
Last updated: 3 days ago • Promoted
Senior Vice President, Full-Stack Engineer Opportunities

Senior Vice President, Full-Stack Engineer Opportunities

BNY • Greater London, England, United Kingdom
Full-time
Senior Vice President, Full‑Stack Engineer Opportunities.At BNY, our culture allows us to run our company better and enables employees’ growth and success.As a leading global financial services com...Show more
Last updated: 23 days ago • Promoted
Senior SRE Engineer: Azure Reliability & Observability Lead

Senior SRE Engineer: Azure Reliability & Observability Lead

Prism Digital • Greater London, England, United Kingdom
Full-time
A leading technology company in the UK is seeking a Senior SRE Engineer to establish SRE practices on its Azure-based platform.The role involves enhancing reliability, observability, incident manag...Show more
Last updated: 11 days ago • Promoted
Senior SRE: Reliability, CI/CD & Observability Lead

Senior SRE: Reliability, CI/CD & Observability Lead

Sphere Digital Recruitment • Greater London, England, United Kingdom
Full-time
My client is looking for a skilled Senior Site Reliability Engineer to play a key role in improving the reliability, scalability, and operational performance of their production systems.This role w...Show more
Last updated: 7 days ago • Promoted
Vice President, Platform Engineering - Windows SRE

Vice President, Platform Engineering - Windows SRE

MUFG • London, England, United Kingdom
Full-time
Do you want your voice heard and your actions to count? Discover your opportunity with Mitsubishi UFJ Financial Group (MUFG), one of the world’s leading financial groups.Across the globe, we’re 150...Show more
Last updated: 10 days ago • Promoted
VP, Inkind Transfer Product & Strategy

VP, Inkind Transfer Product & Strategy

State Street • London, England, United Kingdom
Full-time
A leading financial services firm in Greater London seeks a Vice President for their Alpha Service product team.This role involves developing product features and improving client experiences in th...Show more
Last updated: 10 days ago • Promoted
VP, R&D — Lead Breakthrough Home Innovations

VP, R&D — Lead Breakthrough Home Innovations

Ninjakitchen • Greater London, England, United Kingdom
Full-time
A leading global innovation company is seeking a Vice President of Research & Development to shape the future of product experience.This role involves setting the vision and strategy for consumer-f...Show more
Last updated: 12 days ago • Promoted
Senior SRE - Global-Scale Reliability & Deployments

Senior SRE - Global-Scale Reliability & Deployments

Harnham • Greater London, England, United Kingdom
Full-time
A leading entertainment brand seeks a Senior Site Reliability Engineer to enhance the reliability and operational excellence of its digital commerce platform.You will lead incident responses, defin...Show more
Last updated: 9 days ago • Promoted
Senior SRE: Scale, Observability & Resilience (Contract)

Senior SRE: Scale, Observability & Resilience (Contract)

CBSbutler • Greater London, England, United Kingdom
Temporary
A global digital platform services company in the UK is seeking a Senior Site Reliability Engineer (SRE) for a 12-month contract.The role involves improving the reliability, scalability, and perfor...Show more
Last updated: 12 days ago • Promoted
Vice President, Platform Engineering - Windows SRE

Vice President, Platform Engineering - Windows SRE

MUFG Bank, Ltd • Greater London, England, United Kingdom
Full-time
Platform Engineering - Windows SME page is loaded## Platform Engineering - Windows SMElocations: Londontime type: Full timeposted on: Posted Todayjob requisition id: 10074184-WDDiscover your opport...Show more
Last updated: 30+ days ago • Promoted
Vice President, Platform Engineering - Windows SRE

Vice President, Platform Engineering - Windows SRE

Jobleads-UK • Greater London, England, United Kingdom
Full-time
Platform Engineering - Windows SME page is loaded## Platform Engineering - Windows SMElocations: Londontime type: Full timeposted on: Posted Todayjob requisition id: 10074184-WDDiscover your opport...Show more
Last updated: 2 days ago • Promoted
Senior Vice President, EMEA AI Hyperscale

Senior Vice President, EMEA AI Hyperscale

WNTD • Greater London, England, United Kingdom
Full-time
Greater London, England, United Kingdom.Senior Vice President, EMEA – AI-native hyperscale cloud group.This is a senior regional leadership role with broad scope and significant autonomy.The succes...Show more
Last updated: 23 days ago • Promoted
GenAI Strategy & Enablement Lead - VP

GenAI Strategy & Enablement Lead - VP

JPMorgan Chase & Co. • Greater London, England, United Kingdom
Full-time
A global financial services firm is seeking a GenAI Enablement Lead within the Commercial & Investment Bank (CIB).This Vice President-level role involves developing GenAI strategies and deploying G...Show more
Last updated: 30+ days ago • Promoted
Vice President, Platform Engineering - Windows SRE

Vice President, Platform Engineering - Windows SRE

MUFG Americas • Greater London, England, United Kingdom
Full-time
Do you want your voice heard and your actions to count?.Discover your opportunity with Mitsubishi UFJ Financial Group (MUFG), one of the world’s leading financial groups.Across the globe, we’re 150...Show more
Last updated: 4 days ago • Promoted