It Site Reliability Engineering (sre) Lead

Singapore, Singapore

Job Description


Job: Technology
Primary Location: Asia-Singapore-Singapore
Schedule: Full-time
Employee Status: Permanent
Posting Date: 13/Jul/2022, 12:50:00 AM
Unposting Date: 12/Aug/2022, 5:59:00 PM

We are New! We are growing! We're a Startup! We are SC Digital Bank!
We are looking for strong talent to be working within our brand new Digital Bank. We are a growing venture with new and exciting problems to solve on a daily basis focused on working towards the mission of creating Singapore's Digital Bank. We are well-backed, agile and do focused work within interdisciplinary teams. If this sounds like a place where you want to work - don't hold back! Send through your profile and get connected in with one of our recruiters to find out more.
We're looking for an SRE Lead to work onsite within our brand-new Digital Bank. We're a small, but growing venture, with new and exciting problems to solve. We work in project-based sprints in small, interdisciplinary teams.
We are seeking a technically savvy, experienced, and inspiring leader to be based in Singapore to lead the SRE Function under the Technology Team. The SRE team supports the ‘Run the bank’ function in the Digital Bank and are responsible for ensuring we have the right processes, people and tools in place to keep our environment running 24x7.
The role is includes day-to-day operational management aspect of Technology Services, this includes active management of partners and outsourced service providers. The role involves managing the team (onshore and offshore), supporting BAU support services, developed creative ways to solve problems, bring an engineering mindset whilst engaging with the Technology leadership team. The role will entail people, process, project management and engineering work. We are small, so being hands up and strategic at the same time is expected.
The Role Responsibility:

  • The role is expected to work with the development team and application support team to ensure performance and high level of application availability by preventing incidents through proactive monitoring and incidents correlation as well as constantly establishing and tracking user experience metrics.
  • By bringing an Engineering mindset to the table, the role is expected to be creative in problem solving, passionate about process and understanding code with a view to accelerating problem solving proactively.
  • Assume the role of Major Incident Manager and manage the lifecycle of major incidents. Provide command & control, mobilize resolvers, identify paths for mitigation, track multiple workstreams for closure
  • Besides the above, the role shall assume the BAU responsibilities of managing the ‘Run the bank function’ and overall ownership for the strategy and design of the ITSM processes including Incident Management, Problem Management, Configuration Management (CMDB), Change Management, Event Management.
  • Support service quality deep dives for technology incidents, service disruptions caused by data transmissions failures, batch processing delays, erroneous code deployments, Continuity of Business failures etc.
  • Providing management support in ensuring highest levels of service quality and improving service levels through identification of problem trends and causes which impact the delivery of production services
  • Ability to communicate well and manage highly stressful situations over the phone. Demonstrate proven leadership qualities removing any ambiguity as to who is coordinating the incident resolution.
  • Develop and maintain the Business Continuity Plan and Disaster Recovery Plans for IT and to implement measures designed to safeguard the Information Technology and needs of the business in the event of major incidents or disasters.
  • Design and run the Operational Acceptance Testing strategy for the services moving into production.

Our Ideal Candidate:
  • Strong background and fundamentals in engineering concepts across DevOps / Infrastructure Management. Degree in Engineering or Software Engineering is key.
  • 15+ years of Technology experience or which 5+ years in working as an SRE or in a DevOps / Agile environment
  • Technical knowledge on management of AWS/Cloud hosted services
  • Technical Knowledge on one of more of the following: Java, Python, Kotlin, Observability, SumoLogic, Splunk, Jira Service Management; Data Dog, Grafana, ELK, Terraform
  • Experience in Fresh Service / Jira Service management / ServiceNow or other workflow / request management tools.
  • Excellent verbal and written communication skills with the ability to deliver presentations to multiple levels of the Management.

Beware of fraud agents! do not pay money to get a job

MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Job Detail

  • Job Id
    JD1075920
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Singapore, Singapore
  • Education
    Not mentioned