Fractile Logo

Fractile

ML Runtime Engineer (Mid-Level and Senior)

Reposted 10 Days Ago
Be an Early Applicant
In-Office
Bristol, England, GBR
Senior level
In-Office
Bristol, England, GBR
Senior level
As a Senior ML Runtime Engineer at Fractile, you will integrate AI acceleration hardware with ML software projects, develop high-performance runtimes, and collaborate on co-design methodologies.
The summary above was generated by AI
ML Runtime Engineer (Mid-Level and Senior)

Fractile  ·  London or Bristol  ·  Full-time  ·  Hybrid

About Fractile

We’re taking a revolutionary approach to computing — building AI acceleration hardware that runs the world’s largest language models 100× faster than existing systems. Our team works at the cutting edge of both hardware and software AI development, and we’re growing fast.

If you want your work to have a direct, meaningful impact on how AI runs at scale, you’ll fit right in.

The Role

We’re looking for a Senior ML Runtime Engineer to help us integrate Fractile’s AI accelerators with the latest inference frameworks and build the runtime stack that makes them fly. You’ll work on genuinely hard problems — KV cache management, scalable multi-user inference, and the internals of transformer model execution — alongside a collaborative team that values curiosity and rigour equally.

This is a hybrid role, with offices in London and Bristol — your choice of base.

What You’ll Do
  • Integrate Fractile’s AI acceleration hardware with leading inference engines including vLLM and SGLang
  • Research KV cache management technologies (including paged attention) and build proof-of-concept implementations tailored to our hardware
  • Work closely with the runtime team to design and build a scalable, bare-bones reference inference engine
  • Focus primarily on the transformer ML architecture
  • Share your expertise to help shape the direction of our runtime stack
What We’re Looking For

We care most about depth of knowledge and a genuine interest in the problem space. You’ll be a strong fit if you have:

  • Solid experience with ML inference at scale, including multi-user serving
  • A deep understanding of paged attention and inference engines such as vLLM
  • Familiarity with key components of the ML software ecosystem
  • Strong software engineering skills and an instinct for clean, maintainable systems

Bonus Points

These aren’t requirements, but they’d make you stand out:

  • Experience with Rust
  • Having built your own inference engine from scratch
  • A degree in Computer Science or a related field
Why Fractile
  • Work on one of the most technically ambitious projects in AI infrastructure
  • A small, expert team where your contributions are visible and valued
  • Hybrid working — split your time between home and our London or Bristol office
  • Competitive salary and equity
  • A culture that values learning, directness, and collaboration

Fractile is committed to building a diverse and inclusive team. We welcome applications from people of all backgrounds and actively encourage candidates from underrepresented groups to apply.


Similar Jobs

10 Days Ago
In-Office
Senior level
Senior level
Semiconductor
The Senior ML Runtime Engineer will integrate AI acceleration hardware with open source projects, develop high-performance Rust runtime, and collaborate on hardware-software design.
Top Skills: MlPythonPyTorchRustSglangVllm
An Hour Ago
Easy Apply
Hybrid
Easy Apply
Mid level
Mid level
AdTech • Artificial Intelligence • Machine Learning • Marketing Tech • Software • Sports • Big Data Analytics
Join a multidisciplinary Agile team to design, build and maintain highly distributed, real-time, cloud-native sportsbook and risk-management systems. Contribute across the stack (front-end, back-end, infrastructure), apply good software design, TDD, CI/CD and scalable microservice patterns, and collaborate with DevOps, Data Science and QA to deliver high-quality, observable products.
Top Skills: Agentic AiAWSC#C++CachingCi/CdDatabasesDockerGitGoInfrastructure As CodeJavaJavaScriptKotlinKubernetesMessagingMicroservicesPHPPulsarPythonRabbitMQReactShadcnTdd
An Hour Ago
Hybrid
Senior level
Senior level
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
Monitor and interpret AML, sanctions, and consumer protection regulations; advise on compliance implications for products and markets; maintain regulatory logs; support policy drafting, audits, QA, training, reporting, and stakeholder remediation tracking.

What you need to know about the Bristol Tech Scene

Along with Gloucester, Swindon and Bath, Bristol is part of the "Silicon Gorge" tech hub, a region in the U.K. renowned for its high-tech and research-driven industries, with a particular emphasis on sustainability and reducing environmental impact. As the European Green Capital, Bristol is home to 25,000 cleantech companies, including Baker Hughes and unicorn Ovo Energy. The city has committed to achieving net-zero emissions within the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account