Singapore, Singapore
Company: Singtel Group
At Singtel, our mission is to Empower Every Generation. We are dedicated to fostering an equitable and forward-thinking work environment where our employees experience a strong sense of Belonging, to make meaningful Impact and Grow both personally and professionally. By joining Singtel, you will be part of a caring, inclusive and diverse workforce that creates positive impact and a sustainable future for all.
Be a Part of Something BIG!
Position Summary
The Digital Technology department in Group IT are responsible for building all of Singtel\'s consumer and enterprise facing, channel applications & APIs, from mobile applications to eCommerce websites. These applications are primarily built using a microservices architecture on the AWS public cloud. The majority of our applications are deployed as Spring Boot microservices on Kubernetes (AWS EKS).
The Digital Technology Site Reliability Engineering (SRE) team act as an enabler for our application squads, empowering these \xe2\x80\x9cBuild-Run\xe2\x80\x9d teams to take more responsibility for their production systems, by ensuring that they have the right tools, skills and hands-on experience to succeed in production.
The SRE team consists of Software Engineers who are tasked with building the next generation operations automation & observability platform for our hybrid multi-cloud applications. This platform focuses on the deploy, operate and monitor phases of the DevOps lifecycle. The platform is primarily an extension and customisation of Kubernetes as our standard runtime platform, so prior experience as a software engineer working on or preferably extending Kubernetes is essential.
The SRE team also mentor and guide application squads on operational best practices for cloud native applications, such as configuration & secret management, observability engineering, data operations, security, etc. as well as partnering with these team to evolve a culture of SRE practices including fundamentals such as SLIs, SLOs and error budgets. The team will also be involved in production & non-production incidents, including post-mortems & advising and in some cases delivering permanent corrective actions.
Make an Impact by
Building & supporting the next generation Cloud Application Runtime Platform for Digital Technologies
Mentoring development squads on Kubernetes, cloud engineering, cloud native best practices including configuration & observability
Building self-service operations, delivery pipelines, primarily GitOps based with kubernetes controllers
Cloud native application monitoring, log aggregation and distributed tracing platforms and tooling
Supporting production incidents and solution of permanent corrective actions
Skills for Success
Mandatory Experience
BSc in Computer Science or related discipline
Min. 8 years of work experience in Java software development, including 3+ years of Spring Boot microservice development. You will be required to sit a Java programming test.
Experience building and deploying cloud native applications on Kubernetes (EKS, AKS, OCP)
Solid understanding of Linux, networking & distributed systems
Observability tools, ELK, Prometheus, AWS CloudWatch, Grafana, Jaeger, OpenTelemetry, etc.
Desirable Experience
Building and deploying cloud native applications on the public cloud, ideally on AWS.
Working in a start-up or another environment with a wide scope of responsibilities
Working on a platform engineering team
Production support of systems, for example, on a L3 support rota, etc
Professional certifications in Java, Linux, Networks, AWS or Kubernetes a bonus
Beneficial if you have experience in some of the following:
Experience working on an SRE team / being part of an SRE maturity journey
Argo stack; ArgoCD for GitOps, Events, Workflows, Rollouts, etc
Go, Python or Groovy development
Experience analysing Java heap / thread dumps
Data Engineering & Analysis, ETL pipelines, etc.
Cloud Engineering on Hybrid multi-clouds using Terraform, Crossplane, AWS ACK & CDK
Network Mgmt. tools such as, Apigee, Kong, Envoy, NGINX, Istio
Configuration Mgmt. tools such as Ansible, Puppet or Chef
Application Performance Monitoring tools, such as Dynatrace, AppDynamics, Elastic APM
JBoss, Tomcat, Redis, PostgreSQL, MongoDB
Rewards that Go Beyond
MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.