As a Site Reliability Engineer (SRE), you'll help build a meaningful engineering discipline, combining software and systems to develop creative engineering solutions to operations problems. Much of our support and software development focuses on optimizing existing systems, building infrastructure, and reducing work through automation. You'll join a team of curious problem solvers with a diverse set of perspectives who are thinking big and taking risks. In this environment, you'll take the lead on relevant projects, supported by an organization that provides the support and mentorship you need to learn and grow. As an SRE, you'll be focused on running better production applications and systems.
As a Site Reliability Engineer, you are responsible for the development and implementation of processes necessary to improve application / system reliability along with operational support. Your expertise in application performance, analyzing complex data systems, anticipating problems and finding ways to mitigate risk, will be key focus of a high performing team to successfully design and navigate the program roadmap.
By incorporating your hands-on knowledge with application development and mission critical production environments, you will affect change, drive automation, and development of innovative improvements and world-class practices.
You will be responsible for both uplifting and maintaining our evolving technology platforms, infrastructure and technology controls. This includes production operations of our systems, as well as development/engineering of solutions to improve observability & traceability, DevOps tasks such as building CI/CD pipelines and maximize system reliability. You may also be involved in defining Service Level Objectives (SLO) and measure performance by implementing Service Level Indicators (SLI).
Your role also include root cause analysis of incidents and pro-active prevention of recurrence through the creative design and development of technical solutions & process improvements. You will partner with Infrastructure, Operations and AD teams to identify and implement automation opportunities to drive down toil, reduce technical debt and improve system stability.
Best of all, you'll be able to harness massive amounts of brainpower through our global network of technologists from around the world to tackle big challenges.
Responsibilities:
MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.