Spearheaded daily IT Infra and DC operations ensuring system functionality, availability, data security, and compliance with industry standard.
Lead a team of support engineers, assign R&R, performance review and roster arrangement to ensure 24x7 operations coverage.
Conduct routine site inspection to assess the condition of the facilities, equipment, and adherence to safety protocols and procedures.
Create and maintain SOPs and policies to ensure consistent and efficient DC operations. Provide Training to support engineers and vendors
Oversee the daily operations and monitor data centre device alerts, manage incidents for servers, KVM, Network devices, UPS, storage solutions, power, cooling and security.
Daily Ticket review with the team to ensure the SLA is achieved and maintain a proper ticket hygiene.
Maintain org chart, escalation matrix and the emergency contacts lists.
Contract management, including outsourcing reviews, renewal process and ensuring compliance with agreed terms.
Manage Data centre projects including DC migration, AD migration, Physical moving, decommissioning of servers, physical to virtual migration, rack optimization and disposals of servers and equipment based on standard data sanitization practice.
Manage vendor relationships, including negotiating contracts, overseeing installations, and ensuring SLA compliance.
Manage DC inventory including rack layout, UPS, cabling, servers, storage, network equipment, KVM, keys, access cards, and PDU.
Tape inventory management including tape storage and ship out/into DC. Conduct annual tape disposal exercise based on clients policy.
Conduct capacity planning and forecasting to support future growth & scalability of DC structure. Develop and implement a cost saving initiative that reduce operational expenses.
Lead incident management, audit response coordination, improving the organization\'s readiness and resilience.
Coordinate with IT and network teams to ensure optimal performance and uptime of servers, storage and network equipment.
Maintain and regular review of DC handbook and DC access registry, access card control, rack keys log and CCTV review.
Disaster recovery planning and implementation efforts to minimize downtime and data loss. Conduct DR drill exercise Annually.
Generate Weekly, Monthly and Quarterly reports for the client on SLA, DC Maintenance activity, power consumption, bandwidth utilization, UPS, Steercom, operations, major incidents and ongoing projects.