:Your ImpactSite Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. At Goldman Sachs, SRE is responsible for improving the availability and reliability of some of the firm\'s most critical platform services, and ensures they meet the requirements of our internal and external users. We are looking for engineers who are motivated to collaborate with our businesses to build and run sustainable production systems, which can evolve and adapt to changes in our fast-paced, global business environment.The SRE team develops and maintains platforms and tools which help other engineering teams in Goldman Sachs to build and operate reliable and resilient systems. The platforms we offer range from central logging and tracing to monitoring and alerting and we provide tools to drive adoption and improvements to capacity planning, operational readiness assessments, production incident postmortems, SLIs / SLOs, and deployment automation including canary releases.The products and services we provide to our internal customers are used by thousands of engineers every day. We believe that reliability is the most important feature of any system, and we are devoted to giving our engineers the platforms and tools they need to build and operate reliable products.How You Will Fulfil Your PotentialAs a developer in the SRE team, you will work with internal customers, product owners, and SREs to design, develop, and support the platforms and tools we provide to other engineering teams to enable them to run reliable large scale production systems spanning cloud and on-prem datacenters.Responsibilities
MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.