Octopus Deploy

Senior Site Reliability Engineer

Posted 9 Days Ago

Be an Early Applicant

Remote

Hiring Remotely in NZ

Senior level

Remote

Hiring Remotely in NZ

Senior level

The Senior Site Reliability Engineer ensures the reliability and availability of build systems, leads implementations, and promotes safety culture while collaborating across teams to improve reliability practices.

The summary above was generated by AI

Octopus Deploy sets the standard for Continuous Delivery, empowering software teams to deliver value in an agile way. Over 4,000 organizations globally – including Ubisoft, Xero, Stack Overflow, NASA, and Disney – rely on our Continuous Delivery, GitOps, and release orchestration solutions.

We’re a profitable scale-up of 300+ people, growing steadily. We’ve built a high-trust, remote-first, and value-driven culture where people are given space to do their best work.

The Builds team at Octopus Deploy is looking for a Senior Site Reliability Engineer (SRE) to:

Share SRE expertise with teams across the company.
Keep our build systems running with high reliability and availability.
Improve and iterate on our existing reliability practices.
Bring fresh ideas and practices to increase reliability and reduce toil.
Lead the implementation of new capabilities.

You’ll be a great fit if you:

Naturally work in line with our Senior SRE expectations.
You collaborate effectively, even across wide organisational distances, to solve problems, combining passion, pragmatism, and empathy.
Thrive in an environment focused on availability, reliability, and observability.
Are a strong systems engineer and may have deeper expertise in particular domains.
See value in applying safety culture lessons from other industries to software and operations.
Are comfortable leading postmortems and designing deployment and monitoring pipelines.
Care deeply about automation across builds, tests, deployments, infrastructure, and operational tasks.
Embrace a “you build it, you run it” culture, with a strong commitment to quality and system availability, and are happy to participate in a humane on-call program.
Are self-motivated, work independently with high-quality output, and proactively seek help or new work when needed.
Are results-oriented, adapt quickly when business direction changes, and encourage the same in others.
Welcome candid feedback, enjoy solving complex problems, and like helping other engineers succeed while working on genuinely valuable projects.

Our tech stack

You don’t need to know all of this – it’s here to give you a feel for our environment.

Octopus Server

Our primary focus and flagship product.
Written in .NET and backed by a SQL database.
Experience with the C# application SDLC (e.g. building, testing) is highly regarded.

CI/CD

TeamCity is our primary build system for Octopus Server.
GitHub Actions is used for some internal tools.
Continuous delivery is powered by Octopus Deploy.

Workloads

A mix of internally developed applications and third-party software (e.g. TeamCity).
Run in Azure using App Services, AKS clusters, and Azure Functions.
Container workloads run on AKS, with Docker Hub and Artifactory as container registries.

Infrastructure as Code (IaC)

Terraform is our primary IaC tool.
IaC workloads run mostly in Octopus Deploy, with some running via GitHub Actions.

Observability

Our team operates a multiregion OpenTelemetry processing system for the rest of R&D.
We’ve adopted OpenTelemetry across many of our Builds systems.
We help other teams adopt OpenTelemetry for more use cases company-wide.
We use Sumo Logic and Honeycomb for analysis and troubleshooting.

A typical day might include:

Building new capabilities to increase reliability (we don’t want you staring at dashboards all day).
Working where you do your best work – from your home office, with your preferred setup, tools, and soundtrack.
Consulting with another team on how to operate their services at the right level of reliability, or how best to use our build and observability platforms.
Pairing with another engineer over Zoom to solve a complex technical problem or explore the problem space for future improvements.
Responding to an actionable alert and working to maintain the reliability of the platform used across the company.
Improving documentation so engineers can discover solutions themselves and reduce lead time.
Writing a blog post or preparing a talk to share something interesting you’ve learned with other engineers.
Facilitating an incident review and turning the learnings into practical changes.
Proactively reducing toil by building thoughtful automation.

Compensation:

Octopus has an internally open and transparent system for compensation. Any Octonaut can view the compensation for any role at any level. This ensures people doing the same work with the same skill get paid the same.

The compensation for this role is:

Level 3 - Senior Site Reliability Engineer

Maturing: $145k AUD / $155k NZD, Performing: $165k AUD / $175k NZD

Salaries exclude Super and Kiwi Saver.

Benefits include a minimum of 25 days annual leave, up to 10 days of paid sick and carers leave, 12 weeks of fully paid parental leave with flexible return options, and stock options. Learn more.

Below is the interview process you can expect for this role. We know interviewing can seem daunting, but rest assured we designed our interview process to move quickly while still getting you all the information you need.

👋🏼Initial Chat

[30 min] Meet with a Talent Acquisition team member, and get a feel for what it would be like to be an Octonaut!

💻Engineering Problem Presentation

[75 min] You'll be given instructions to prepare a presentation which you'll present to two members of the team (15-20 minutes) before being asked some questions.

🧑‍💻Hiring Manager chat

[30 min] A final call to answer any last questions of yours and ours.

We are looking for people who live and work in Australia and New Zealand to join our remote-first team. We are unable to provide visa sponsorship.

Top Skills

.Net

Aks

App Services

Azure

Azure Functions

Docker

Github Actions

Honeycomb

Opentelemetry

SQL

Sumo Logic

Teamcity

Terraform

Similar Jobs

Canonical

Senior Site Reliability Engineer

6 Days Ago

Easy Apply

Remote

New Zealand

Easy Apply

Senior level

Cloud • Software

The Senior Site Reliability Engineer will automate operations using Python, manage Kubernetes and OpenStack clusters, and ensure high availability for enterprise infrastructures.

Top Skills: KubernetesLinuxOpenstackPython

Canonical

Senior Site Reliability / Gitops Engineer

6 Days Ago

Easy Apply

Remote

New Zealand

Easy Apply

Senior level

Cloud • Software

The Senior Site Reliability / Gitops Engineer will drive automation and collaboration within the IS team, enhancing Canonical's IT operations and services while managing infrastructure as code and cloud technologies.

Top Skills: Cloud ComputingDockerElasticsearchGitopsGrafanaIacKubernetesLinuxPrometheusPython

Canonical

Site Reliability / Gitops Engineer

6 Days Ago

Easy Apply

Remote

New Zealand

Easy Apply

Mid level

Cloud • Software

As a Site Reliability / Gitops Engineer, you will automate operations, develop Infrastructure as Code, maintain core services, and collaborate on service architecture.

Top Skills: Ci/CdCloud ComputingElasticsearchGrafanaInfrastructure As CodeLinuxPrometheusPython

What you need to know about the Bristol Tech Scene

Along with Gloucester, Swindon and Bath, Bristol is part of the "Silicon Gorge" tech hub, a region in the U.K. renowned for its high-tech and research-driven industries, with a particular emphasis on sustainability and reducing environmental impact. As the European Green Capital, Bristol is home to 25,000 cleantech companies, including Baker Hughes and unicorn Ovo Energy. The city has committed to achieving net-zero emissions within the next decade.