To Say Hello!

Find next jobs

job_search_content_direct

Site Reliability Engineer

Grab Vietnam
Updated: 18/09/2018

Employment Information

Job requirement

At Grab, observability is more than just collecting timeseries data and creating dashboards. We are looking for engineers that can help us build intelligent insight in helping all the Grab engineers in understanding the behaviour and performance of the largest ride sharing platform in South East Asia.

As an observability focused Site Reliability Engineer (SRE) at Grab, you’ll be part of a distributed SRE team that leverages open source and third-party solutions to help improve debuggability of systems, event correlation, timeseries collection and graphing.

As part of the SRE team, you will:

  • Work with engineering teams to design and write code to create systems which are highly available and able to scale seamlessly.

  • Help improve reliability, stability and tackle scalability challenges with engineering teams

  • Get involved in deep diagnosis of incidents, and engage with multiple highly skilled engineering teams on resolutions.

  • Contribute to a culture of learning and responsibility by writing detailed postmortem reports.

  • Identify and resolve problems relating to critical service operations and to prevent their recurrence using automation.

  • Be part of a cool team, responsible for one of the largest cloud based services in South East Asia.

  • Mentor other engineers, define our technical culture, and help build a fast-growing team

Requirements

  • Experience in designing and writing software for production systems.

  • Possess analytical skills, mental resiliency and the ability to think systematically under stressful conditions

  • Possess a solid understanding of the Linux or FreeBSD/OpenBSD family of Operating Systems and their underlying components

  • Possess a solid understanding of the OSI networking model (TCP/IP)

  • 2+ years of relevant experience with managing IT infrastructure with focus on the *nix platforms

  • Experience in one or more of: Go, Python, Perl or scripting experience in Shell

  • Highly accountable and takes ownership. Outstanding work ethic, high-integrity, team player, and a lifelong learner

  • Preferably a degree in computer science, software engineering, information technology or related fields

Really Nice to Haves

  • Experience with cloud computing technologies from vendors such as Amazon Web Services, Azure or Google Cloud Platform

  • Configuration management tool experience such as Ansible, Chef, Puppet or SaltStack.

  • Experience with building a monitoring solution (ELK, Prometheus, OpenTracing) is very beneficial.

  • Experience with hardening systems and knowledge in information security.

  • Contributes to open source project experience with performance analysis and debugging tools.

Company Overview

Grab Vietnam

Similar Jobs

Site Reliability Engineer

Grab Vietnam