Site Reliability Engineer

Singapore, Singapore

Job Description


Are you seeking an environment where you can drive innovation? Does the prospect of working with top engineering talent get you charged up? Apple is a place where extraordinary people gather to do their best work. Together we create products and experiences people once couldn\'t have imagined - and now can\'t imagine living without! Think platform-as-product! Our team delivers great developer experiences to our Program, Project and Development teams through curated set of tools, capabilities and processes offered through our Internal Developer Platform. We automate infrastructure operations, support complex service abstractions, build flexible workflows and curate a frictionless ecosystem that enables end-to-collaboration to help drive productivity and engineering velocity.

Key Qualifications Key Qualifications

  • Experience working on Cloud Native SRE and Operational functions
  • Experience supporting customer facing systems in an 24-7 uptime environment of distributed systems
  • Ability to understand and utilize monitoring and observability data to manage Platform SLIs and SLOs
  • Experience handling production incidents and working towards resolution and stakeholder communication during incidents.
  • Automation focus for operational efficiency - designing and implementing automation processes for repeatable and consistent service deployment
  • A strong sense of ownership. Good critical thinking & interpersonal skills to work successfully across diverse business and technical & cross-functional teams.
  • Working knowledge of on-prem and cloud based hybrid architectures and infrastructure concepts of zones, regions, VPCs etc.
  • Good understanding of common authentication schemes, certificates, secrets and protocols
  • Scripting and/or coding skills needed for automation, triaging and troubleshooting.
Description Description

Work with a team of devops and SRE engineers to provide operational response for applications in public cloud platforms. Review go-live rediness, understand and drive change management activities for highly availability and low/no distruption Understand processes to improve incident coordination among Apple teams. Keep up to date with the latest technologies and tools in devops and SRE space and help with adoption for operational support. Maintain services once they are live by setting up monitoring and alerting. Help measure availability, latency, and overall system health. Strive for top quality results and continuously look for ways to improve and enhance platform reliability, performance, and security.

Education & Experience Education & Experience

Apple

Beware of fraud agents! do not pay money to get a job

MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Job Detail

  • Job Id
    JD1375298
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Singapore, Singapore
  • Education
    Not mentioned