Responsible for monitoring Shopee & Monee's backbone network, data center network, and related networks. Ensure the rapid detection, notification, location, and resolution of network faults to maintain application stability and availability.
Propose and implement software systems to manage network related processes, for example maintenance.
Collect and establish databases for network devices, links, and configurations. Set up accurate and automated network alerts and notifications using software tools.
Requirements :
Bachelor's in Computer Science, Information Science or a related field, or equivalent.
More than 5 years of relevant experience
Passionate about coding and programming, innovation, and solving challenging problems.
In-depth understanding of computer science fundamentals (data structures and algorithms, operating systems, networks, databases, software architecture, etc).
Strong and hands-on experience with at least one of the programming languages: Go, Python.
Strong logical thinking ability.
Skills below are optional but preferable
Experience with network reliability systems such as pingmesh and network CMDB.
Published papers at conferences like USENIX NSDI, ACM SIGCOMM, IEEE INFOCOM, or other related conferences.
About the Team : The NDRE (Network Development and Reliability Engineering) team is responsible for the network infrastructure of Shopee and Seamoney. Our mission is to maintain and enhance the stability, availability, and manageability of the network infrastructure. We achieve this by developing software and standard processes to manage and monitor these network systems.