NVIDIA Logo

NVIDIA

Senior Machine Learning Applications and Compiler Engineer, LPX

Reposted 8 Days Ago
Be an Early Applicant
In-Office or Remote
Hiring Remotely in Cambridge, Cambridgeshire, England
Senior level
In-Office or Remote
Hiring Remotely in Cambridge, Cambridgeshire, England
Senior level
Design and implement compiler and runtime optimizations for large-scale inference, map neural network workloads onto NVIDIA hardware, benchmark and profile performance, prototype new compilation/runtime techniques, and collaborate with hardware and software teams to influence architecture and tools.
The summary above was generated by AI

NVIDIA is seeking engineers to develop algorithms and optimizations for our LPX inference and compiler stack. You will work at the intersection of large-scale systems, compilers, and deep learning, crafting how neural network workloads map onto future NVIDIA platforms. This is your chance to be part of something outstandingly innovative!

 

What you’ll be doing:

  • Build, develop, and maintain high-performance runtime and compiler components, focusing on end-to-end inference optimization.

  • Define and implement mappings of large-scale inference workloads onto NVIDIA’s systems.

  • Extend and integrate with NVIDIA’s SW ecosystem, contributing to libraries, tooling, and interfaces that enable seamless deployment of models across platforms.

  • Benchmark, profile, and monitor key performance and efficiency metrics to ensure the compiler generates efficient mappings of neural network graphs to our inference hardware.

  • Collaborate closely with hardware architects and design teams to feedback software observations, influence future architectures, and codesign features that unlock new performance and efficiency points.

  • Prototype and evaluate new compilation and runtime techniques, including graph transformations, scheduling strategies, and memory/layout optimizations tailored to spatial processors.

  • Publish and present technical work on novel compilation approaches for inference and related spatial accelerators at top tier ML, compiler, and computer architecture venues.

 

What we need to see:

  • MS or PhD in Computer Science, Electrical/Computer Engineering, or related field, or equivalent experience, with 6 years of relevant experience.

  • Strong software engineering background with proficiency in systems level programming (e.g., C/C++ and/or Rust) and solid CS fundamentals in data structures, algorithms, and concurrency.

  • Hands on experience with compiler or runtime development, including IR design, optimization passes, or code generation.

  • Experience with LLVM and/or MLIR, including building custom passes, dialects, or integrations.

  • Familiarity with deep learning frameworks such as TensorFlow and PyTorch, and experience working with portable graph formats such as ONNX.

  • Solid understanding of parallel and heterogeneous compute architectures, such as GPUs, spatial accelerators, or other domain specific processors.

  • Strong analytical and debugging skills, with experience using profiling, tracing, and benchmarking tools to drive performance improvements.

  • Excellent communication and collaboration skills, with the ability to work across hardware, systems, and software teams.

  • Ideal candidates will have direct experience with MLIR based compilers or other multilevel IR stacks, especially in the context of graph based deep learning workloads.

 

Ways to stand out from the crowd:

  • Prior work on spatial or dataflow architectures, including static scheduling, pipeline parallelism, or tensor parallelism at scale.

  • Contributions to opensource ML frameworks, compilers, or runtime systems, particularly in areas related to performance or scalability.

  • Demonstrated research impact, such as publications or presentations at conferences like PLDI, CGO, ASPLOS, ISCA, MICRO, MLSys, NeurIPS, or similar.

  • Experience with large-scale AI distributed inference or training systems, including performance modeling and capacity planning for multi rack deployments.

 

#LI-Hybrid

NVIDIA Bristol, England Office

Romborne, 160 Aztec W, Almondsbury, Bristol, United Kingdom, BS32 4TU

Similar Jobs

13 Hours Ago
Remote or Hybrid
Senior level
Senior level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
As a Senior Associate, you will implement Oracle HCM solutions, analyze problems, mentor junior staff, manage client relationships, and ensure quality deliverables.
Top Skills: Cc&BEbsHyperionOracle FusionOracle HcmPeoplesoftSiebel
15 Hours Ago
Remote
UK
Senior level
Senior level
Information Technology
As a Senior Data Scientist, you will lead high-complexity projects, develop ML and NLP solutions, and collaborate across teams to drive business impact through statistical modeling and data analysis.
Top Skills: BigQueryClickhouseDruidLlmsMachine LearningNatural Language ProcessingPower BIPythonRedshiftSQLTableau
16 Hours Ago
Remote or Hybrid
Mid level
Mid level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
The role involves managing client needs through technology solutions, mentoring team members, analyzing complex problems, and using AI/GenAI to enhance productivity and client relationships.
Top Skills: Advanced LearningAWSAzureGitGCPLlm Development FrameworksMachine LearningPython

What you need to know about the Bristol Tech Scene

Along with Gloucester, Swindon and Bath, Bristol is part of the "Silicon Gorge" tech hub, a region in the U.K. renowned for its high-tech and research-driven industries, with a particular emphasis on sustainability and reducing environmental impact. As the European Green Capital, Bristol is home to 25,000 cleantech companies, including Baker Hughes and unicorn Ovo Energy. The city has committed to achieving net-zero emissions within the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account