Conduct the entire Shopee technical project life cycle from requirements gathering to deployment including Shopee data pipeline product, cloud product, IT infrastructure and technical platform to achieve business goals (e.g. specialised cache system)
Advise technical management and coordinate design, migration, and integration
Analyse database resource usage, propose and plan for database/server optimisation to increase server resource utilisation/efficiency
Drill with Shopee infrastructure team for Shopee databases and Shopee Cloud databases and deal with Disaster Recovery (DR) planning, setup, management
Plan IT infrastructure for major marketing events (e.g. 11.11 sales), including planning resource, improving technical product and database performance, and coordinating stress tests
Plan and deliver IT infrastructure (data center, server, network and etc) capacity as well as manage resources
Prioritise requirements from various stakeholders such as business operations, product management, application system developers, etc
Establish clear milestones with visible progress regularly to ensure timely project delivery
Requirements :
Bachelor's degree or higher in Computer Science, a related technical field or equivalent practical experience
Over 2 years of experience in software development or project management is highly preferred.
Proven experience in DevOps, Disaster Recovery, and Capacity Planning.
Experience in network deployment/engineering, product design, network content delivery, or data center networking within the telecommunications or internet industries is highly preferred.
Familiarity with working on cloud components to develop highly scalable and available systems, including APIs, automation, and data warehousing/analytics, is a plus.
Strong written and verbal communication skills, with the ability to engage both technical and non-technical stakeholders at all levels of the organisation.
Comfortable working in a fast-paced, agile environment.
About the Team :About the TeamThe mission of the Shopee Tech Ops MRE (Machine Reliability Engineering) team is to ensure efficient and sustainable operation of the Shopee network and hardware level 24x7, building and maintaining massive hardware clusters for SRE and capacity, in terms of capacity, cost and hardware performance. The team provides sustainable hardware resources and stable network support services. MRE needs to communicate with the data centre team to design and optimise network architecture; provide reasonable hardware configuration through hardware testing and selection according to business requirements; customise stable and efficient OS; optimise traditional operation through engineering and service means; and build a complete hardware monitoring system to improve the efficiency of fault handling.