Engineer - Site Reliability

Last updated 18 days ago
Location:Wellington
Job Type:Full Time

Xero is a beautiful, easy-to-use platform that helps small businesses and their accounting and bookkeeping advisors grow and thrive.

At Xero, our purpose is to make life better for people in small business, their advisors, and communities around the world. This purpose sits at the centre of everything we do. We support our people to do the best work of their lives so that they can help small businesses succeed through better tools, information and connections. Because when they succeed they make a difference, and when millions of small businesses are making a difference, the world is a more beautiful place.

How you’ll make an impact

The Reliability team is passionate about empowering our internal customers to build reliable and efficient software. We do this by having an education focus, creating world class tooling and providing a follow the sun global support model for Xero’s customer-facing applications to achieve an optimal level of operational performance.

In this role, you will work closely with Xero’s Product teams to agree on shared objectives, build and implement sophisticated monitoring and remediation toolsets, practice technical leadership in a team and create a culture focussed on continually improving the operation of Xero’s platform and applications.

What you’ll do:

  • Empower our Xero developers to create more efficient, scalable and reliable applications for Xero's customers.
  • Improve on Xero's internal alerting and analysis tools to enable faster problem detection and recovery.
  • Support our product teams with advanced troubleshooting and root cause analysis techniques for identifying issues.
  • Provide guidance around tooling, standards and best practices in monitoring, tracing, logging and general observability.
  • Build and maintain tools that reduce toil in managing our monitoring and logging platform, and make it easier for other engineering teams to achieve a high standard of monitoring and logging.
  • What you’ll bring with you:

  • Have experience as a Site Reliability / DevOps engineer or alternatively prior experience as a software engineer / developer.
  • Experience managing and integrating with logging / monitoring solutions such as (but not limited to) New Relic, Datadog, Dynatrace, SignalFX, Scalyr, Sumo Logic and Splunk.
  • Have experience working with cloud providers such as AWS, Azure or GCP.
  • An understanding of how to measure and analyse software systems.
  • Good understanding of cloud infrastructure and networking fundamentals.
  • Programming / scripting ability, for example in Python, Golang or C#.
  • Have a passion to learn new software, frameworks, open source tools and development languages.
  • Strong customer service ethic.
  • Be able to engage effectively with both technical and non-technical staff.
  • This role will involve on-call availability and periodic overtime. If you are ready to take on a new challenge in a fast-paced organisation where the sky's the limit we want to hear from you.

    Why you should become a Xero

    It’s a diverse and inclusive environment, with people who will respect, challenge, support and mentor you to do the best work of your life. We’re a place where innovation and change are not only encouraged but also celebrated. We value our people and want them to enjoy and take pride in their work.

    We’re very supportive of flexible working arrangements and offer a competitive remuneration package including shares and life insurance, in addition to your base salary. We have a culture we’re proud of. Whether you're after a workplace with a social vibe, or a workplace which understands your family is priority - Xero is all of that and more.

    Xero is an NZ Immigration Accredited Employer and Rainbow Tick certified too.

    Please include a cover letter in your application, telling us why you’re a great fit for this position.