Talent.com
Callosum
Inference Engine Development - Member of Technical StaffCallosum • London, England
No longer accepting applications
Inference Engine Development - Member of Technical Staff

Inference Engine Development - Member of Technical Staff

Callosum • London, England
8 days ago
Job type
  • Full-time
Job description
About Us Artificial intelligence scaled on a bet - that bigger models, more identical chips, and more data would keep delivering. As problems grow more complex and the requirements of intelligence more diverse, that bet is breaking down. The next era belongs to heterogeneous intelligence: diverse models on diverse chips, each with distinct strengths, co-evolving into systems of capability unreachable by any single model or accelerator. Callosum is the Intelligent Systems company. We built the infrastructure to make that possible. Our co-evolution engine optimises simultaneously across workflows, agents, and silicon. We launched in early 2026 showing orders of magnitude improvements in performance and a shift in the cost-performance frontier that no single chip or model provider can provide. We believe intelligence comes from the system, not the model. We are scientists and engineers solving what others consider impossible. If you thrive on hard problems, and are passionate and energised by the scale of the challenge, we'd love to hear from you. About the Role Callosum believes that orders of magnitude improvements in AI systems will come through application-aware orchestration across heterogeneous hardware. We are building that vision: infrastructure that treats the full landscape of compute as a unified, co-evolving system, evolved beyond GPUs. Inference engines were designed for single-model inference on homogeneous GPU clusters - this role builds them beyond that. Working directly on systems like vLLM and SGLang, you will adapt and extend them for heterogeneous resources, making them hardware-aware, with deeper optimisation around scheduling, memory, and execution. The execution strategies you design - parallelism, disaggregation, caching - will define what heterogeneous inference looks like at production scale. Your work ensures that the capabilities exposed by the lower layers of the stack translate into real, measurable gains, the new standard for how inference runs on diverse hardware. What You'll Build * Contribute upstream to SGLang and vLLM, and maintain internal forks where our requirements diverge * Improve hardware-awareness within inference engines so that scheduling, memory management, and execution adapt to the capabilities of the underlying accelerator * Design and implement bespoke parallelism and disaggregation strategies that go beyond default configurations to better exploit heterogeneous hardware * Work closely with an Accelerator Systems Software engineer to ensure engine-level abstractions map cleanly onto diverse hardware capabilities What You Bring * Deep familiarity with the internals of SGLang, vLLM, or comparable inference serving frameworks - scheduler design, memory management, and execution pipelines * Strong background in high-performance Python and C++/CUDA systems, particularly in the context of ML inference * Experience designing or implementing parallelism strategies for large model serving * Understanding of disaggregated serving architectures and the tradeoffs involved in separating modules of a workflow * Demonstrable record of working effectively in fast-moving open source codebases with evolving APIs and design conventions #J-18808-Ljbffr
Create a job alert for this search

Inference Engine Development - Member of Technical Staff • London, England

Similar jobs

Member of Technical Staff

GeometricCity of London, London, UK
Full-time +1

AI performance is the major tech theme for the next decade.We are building systems that autonomously discover, test, and ship state-of-the-art GPU kernels.Our mission is to fully automate this proc... Show more

 • Promoted

AI Engineer

trg.recruitmentLondon, ENG, GB
Full-time

AI Founding Engineer (Full-Stack) | London | Hybrid | Equity.F4B0; £70–100k + equity | 3-stage interview process.I'm working with a fast-scaling, mission-driven B2B SaaS company tra... Show more

 • Promoted

Multi Skilled Maintenance Engineer

RecruitMeBishop's Stortford, GB
Full-time

We are recruiting for an experienced.Multi-Skilled Maintenance Engineer.FMCG manufacturing site near Bishops Stortford.This is a strong opportunity for an engineer who enjoys being close to product... Show more

 • Promoted

Product Engineer / Member of Technical Staff

Wave GroupCity of London, London, UK
Full-time

Overview: Wave are working with a business that launched 18 months ago, reached double-digit million ARR and are hiring two more engineers: 1.Product Engineer About the company: - Salary: £110-... Show more

 • Promoted

Member of Technical Staff

Omnis PartnersLondon Area, United Kingdom, UK
Full-time

Member of Technical Staff (Full Stack Product Engineer)🚀 Early Stage AI Infrastructure Startup 📍 London | Hybrid 💸 Up to £150k + Up to 0.We’re working with an early-stage AI startup building ... Show more

 • Promoted

Member of Technical Staff (AI Inference Engineer)

PerplexityGreater London, England, United Kingdom
Full-time

We are looking for an AI Inference Engineer to join our growing team.We build and run the inference engine behind every Perplexity query and deploy dozens of model architectures at scale with tight... Show more

 • Promoted

Principal Mechanical Design Engineer

BSV Recruitment Ltdhertford, east anglia, uk
Full-time +1

Principal Mechanical Design Engineer.Hertford, Hertfordshire (office-based with hybrid after probation).Our client is an established, multidisciplinary building services consultancy based in Hertfo... Show more

 • Promoted

Lead Agricultural Engineer

KWS Berlin GmbHthriplow, east anglia, uk
Full-time

This is a hands-on role where you will take ownership of all site needs, ensuring the reliability, performance, and continuous improvement of machinery and equipment supporting plant breeding activ... Show more

 • Promoted

Software Engineer - Milton Keynes

Volkswagen Financial ServicesAbington Pigotts, East of England, United Kingdom
Full-time +1

Software Engineer - Milton Keynes.SALARY:From 42,692 pa dependent on experience.Our current hybrid working policy requires a minimum of 60% of working time to be based in the Milton Keynes office h... Show more

 • Promoted

Mechanical Maintenance Engineer (Manufacturing)

Hudson ShribmanEast Hatley, Cambridgeshire, GB
Full-time

Mechanical Maintenance Engineer c£37k + Bonus.The experience expected from applicants, as well as additional skills and qualifications needed for this job are listed below.An excellent opportunity ... Show more

 • Promoted

Member of Technical Staff (AI Inference Engineer)

Pantera CapitalGreater London, England, United Kingdom
Full-time

We are looking for an AI Inference engineer to join our growing team.Our current stack is Python, Rust, C++, PyTorch, Triton, CUDA, Kubernetes.You will have the opportunity to work on large-scale d... Show more

 • Promoted

Technical Engineer

Lusona LLPLondon,UK
Full-time +1

Job Type: Full-time, Permanent.We are proud to be partnering with a well-established organisation to recruit a Technical Engineer to join their growing team.This is an excellent opportunity for a t... Show more

 • Promoted

Technical Lead Engineer

SkyBexley, United Kingdom
Full-time

Working in Tech, Product or Data at Sky is about building the next and the new.From broadband to broadcast, streaming to mobile, SkyQ to Sky Glass, we never stand still.We turn big ideas into the p... Show more

 • Promoted

Member of Technical Staff: LLM Inference Systems

DoublewordGreater London, England, GB
Full-time

Member of Technical Staff: LLM Inference Systems.We're seeking a Senior Research Engineer to join our mission of solving the hardest inference challenges in generative AI.You'll be responsible for ... Show more

 • Promoted

Specification & Procurement Engineer

Bennett and Game Recruitment LTDBishop's Stortford, Eastern, UK
Full-time

Position: Specification & Procurement Engineer.Find out exactly what skills, experience, and qualifications you will need to succeed in this role before applying below.Specification & Procu... Show more

 • Promoted

Senior/Principal Hardware Engineer

BAE SystemsSouth East London, London, United Kingdom
Full-time

Job Title: Senior/ Principal Hardware Engineer Location: Rochester Onsite Salary: Circa £60,000 depending on skills and experience Who we are: Join BAE Systems and youll be part of something bigger... Show more

 • Promoted • New!

Inference Engine Development - Member of Technical Staff

CallosumGreater London, England, United Kingdom
Full-time

Artificial intelligence scaled on a bet - that bigger models, more identical chips, and more data would keep delivering.As problems grow more complex and the requirements of intelligence more diver... Show more

 • Promoted

Multi Skilled Engineer - Burton Latimer

The Weetabix Food CompanyTadlow, East of England, United Kingdom
Full-time

Were committed to building an organisation where people from all walks of life feel they belongwhere different voices, experiences, and backgrounds are valued and respected.Face to Face at our site... Show more

 • Promoted

Member of Technical Staff - AI Engineer

TesslGreater London, England, United Kingdom
Full-time

Tessl is a fast-growing Series A startup based in London, founded by Guy Podjarny.We’ve raised over $100M from world-class investors including Index Ventures, Accel, GV, and Boldstart, and this yea... Show more

 • Promoted

Senior Hardware Engineer

AVD Appoint LtdWhitehall, London, UK
Full-time

Senior Hardware Engineer - Remote - up to £80,000 + Benefits - REF 2019.Considering applying for this job Do not delay, scroll down and make your application as soon as possible to avoid missing ou... Show more