Position Summary
The Digital Technology department in Group IT are responsible for building all of Singtel\'s consumer and enterprise facing, channel applications & APIs, from mobile applications to eCommerce websites. These applications are primarily built using a microservices architecture on the AWS public cloud. The majority of our applications are deployed as Spring Boot microservices on Kubernetes (AWS EKS).
The Digital Technology Site Reliability Engineering (SRE) team act as an enabler for our application squads, empowering these \xe2\x80\x9cBuild-Run\xe2\x80\x9d teams to take more responsibility for their production systems, by ensuring that they have the right tools, skills and hands-on experience to succeed in production.
The SRE team consists of Software Engineers who are tasked with building the next generation operations automation & observability platform for our hybrid multi-cloud applications. This platform focuses on the deploy, operate and monitor phases of the DevOps lifecycle. The platform is primarily an extension and customisation of Kubernetes as our standard runtime platform, so prior experience as a software engineer working on or preferably extending Kubernetes is essential.
The SRE team also mentor and guide application squads on operational best practices for cloud native applications, such as configuration & secret management, observability engineering, data operations, security, etc. as well as partnering with these team to evolve a culture of SRE practices including fundamentals such as SLIs, SLOs and error budgets. The team will also be involved in production & non-production incidents, including post-mortems & advising and in some cases delivering permanent corrective actions.
Key Responsibilities
MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.