Administer, operate, and maintain Linux-based HPC clusters, including compute, storage, and high-speed networking
Manage and support HPC job schedulers such as
Slurm, PBS Pro, and LSF
Support parallel file systems including
Lustre, GPFS / Spectrum Scale, and BeeGFS
Manage Data Lake solutions (e.g.
VAST
)
Support Hierarchical Storage Management (HSM) solutions (e.g.
Data Management Framework - DMF
)
Manage cluster management and provisioning tools
Perform system monitoring, patching, upgrades, and capacity planning
Troubleshoot and resolve hardware, software, OS, and network issues across HPC environments
Participate in on-call or escalation support rotations as required
Work with software engineers to support
AI / Deep Learning
applications
Collaborate with desktop engineers to assist users as needed
Provide guidance to researchers on HPC application development, debugging, optimization, and parallelization
Deliver HPC user training sessions and contribute to documentation and best-practice guides
Required Certifications
ITIL Foundation
or equivalent (or higher)
Red Hat Certified System Administrator (RHCSA)
or equivalent (or higher)
Key Performance Indicators
Meet SLA requirements for incident and service request handling
Comply with all policy and contract requirements
Required Skills & Attributes
Strong analytical and troubleshooting skills
Highly motivated and self-driven
Strong team player with collaborative mindset
Excellent written and verbal communication skills
Ability to explain complex technical concepts to non-technical users
Commitment to continuous learning and knowledge sharing
Job Type
Contract (1 year, renewable)
Work Location
Singapore (Onsite / as per client requirement)
Job Type: Full-time
Pay: $2,625.19 - $8,243.20 per month
Benefits:
Health insurance
Work Location: In person
Beware of fraud agents! do not pay money to get a job
MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.