About Graphcore
How often do you get the chance to build a technology that transforms the future of humanity? Graphcore products have set the standard in made-for-AI compute hardware and software, gaining global attention and industry acclaim. Now we are developing the next generation of artificial intelligence compute with systems that will allow AI researchers to develop more advanced models, help scientists unlock exciting new discoveries, and power companies around the world as they put AI at the heart of their business. Graphcore recently joined SoftBank Group – bringing large and ongoing investment from one of the world’s leading backers of innovative AI companies.
Job Summary
As a Senior Cloud Software Engineer, you will lead the efforts in enabling new AI accelerator HW within Kubernetes environments. You will be responsible for the design, development, and maintenance of plugins in Go, ensuring seamless integration of a new AI accelerator with existing Kubernetes clusters, and providing a native Kubernetes end user experience. This role requires extensive experience in software development, and container orchestration technologies and cloud computing.
Responsibilities and Duties
- Lead the design and development of plugins in Go for the new AI accelerator integration in Kubernetes.
- Ensure seamless integration of the new hardware with existing Kubernetes clusters.
- Mentor and guide junior engineers, fostering a culture of continuous learning and improvement.
- Collaborate with cross-functional teams to design, implement, and test new features.
- Conduct thorough code reviews and provide constructive feedback to team members.
- Troubleshoot and resolve complex technical issues.
- If necessary, engage with the Kubernetes community, contributing to discussions, forums, and open-source projects.
- Write and maintain comprehensive documentation for your code and the overall project.
- Stay up-to-date with the latest trends and technologies in Kubernetes and cloud compute.
Skills and Experience
- Bachelor’s degree in Computer Science, Engineering, or a related field.
- At least 10 years of experience in software development, ideally with a focus on cloud environments.
- Proficiency in Go or Python programming.
- Extensive experience with Kubernetes with a preference for candidates holding a Certified Kubernetes Administrator (CKA) and Certified Kubernetes Security Specialist (CKS) certifications.
- Familiarity with machine learning-related technologies within the Kubernetes ecosystem e.g. Kubeflow, KubeVirt, Kata containers, Volcano is highly desirable.
- Strong understanding of container orchestration and cloud-native development.
- Familiarity with other workload managers, such as Ray and SLURM, is considered an asset.
- Proven track record of achieving goals while implementing complex technical solutions.
- Knowledge of RDMA networks is considered an asset.
- Knowledge of cloud computing platforms such as Azure, GCP, AWS and their services.
- Experience with CI/CD pipelines and DevOps tools e.g. GitHub/GitLab.
- Leadership and mentoring skills.
- Excellent problem-solving skills and attention to detail.
- Strong communication and collaboration skills.
- English- C1 level.
Benefits
In addition to a competitive salary, Graphcore offers flexible working, annual leave policy, medical and dental health plans, a gym card, medical assessments and employee pension (matched up to 4%). We also have an employee assistance programme (which includes health, mental wellbeing, and bereavement support). We review our benefits on a yearly basis to ensure we offer a valuable and rewarding benefits programme to our employees. We welcome people of different backgrounds and experiences; we’re committed to building an inclusive work environment that makes Graphcore a great home for everyone. We offer an equal opportunity process and understand that there are visible and invisible differences in all of us. We can provide a flexible approach to interview and encourage you to chat to us if you require any reasonable adjustments.
Top Skills
What We Do
Graphcore has created a new processor, the Intelligence Processing Unit (IPU), specifically designed for artificial intelligence. The IPU’s unique architecture means developers can run current machine learning models orders of magnitude faster. More importantly, it lets AI researchers undertake entirely new types of work, not possible using current technologies, to drive the next great breakthroughs in general machine intelligence.
Our next generation 3D Wafer-on-Wafer Bow IPU systems are helping AI innovators worldwide to build better, more innovative AI solutions, whether their focus is on language and vision, exploring graph neural networks and LSTMs or creating something entirely new.
We believe our IPU technology will become the worldwide standard for artificial intelligence compute. The performance of Graphcore’s IPU is going to be transformative across all industries and sectors whether you are a medical researcher, roboticist or building autonomous cars.
Our team is at the forefront of the artificial intelligence revolution, enabling innovators from all industries and sectors to expand human potential with technology. What we do, really makes a difference.
We're always interested in hearing from exceptional people to join our team.