Project: Asynchronous Data Exchange (ADEX)Scope of work:
Responsible for setting up the development and production environment for the project.
Responsible for setting up scalable and maintainable systems and components for the project.
Responsible for managing all network and routing for the project including load balancers.
Responsible for managing all development and production cloud infrastructure such as servers, storage, networks and databases.
Responsible for provisioning virtual machines, databases, application software, and cloud services.
Responsible for developing automation scripts for build and release of systems.
Responsible for planning system work for the project.
Responsible for surfacing any issue and instability to the team.
Responsible for software patching and upgrades.
Required to work with the team to troubleshoot all system issues.
Required to harden all software and systems of the project.
Required to work closely with the team to ensure a successful delivery of the project.
Required to build monitoring and logging system for the project.
Required to work with other FMs and operators to ensure end to end system delivery.
Required to ensure reliability and compliance of all systems to government policies.
Required to work with government hosting team and security team to deliver the system end to end.
Requirements:
Bachelor Degree in IT/Computer Science/Computer Engineering or relevant discipline.
At least 3 years of hands-on experience in DevOps.
At least 3 years of experience working with Ubuntu, RHEL, and Windows environments.
At least 2 years of experience in automation tools such as Terraform, Ansible, etc.
At least 2 years of experience with open-source ecosystems.
At least 2 years of experience with setup and maintaining Kubernetes clusters.
At least 2 years of experience in cloud hosting such as AWS, Azure or Google Cloud.
Delivered at least 2 projects using agile methodology.
Ability to do troubleshooting cloud services.
Ability to do scripting and automation.
Experience in system monitoring and logging.
Experience in GitLab CI/CD pipelines.
Experience in setting up databases such as MongoDB and Postgres.
Preferably:
Linux administrator certifications.
Experience in Golang and Nodejs.
Experience in Solace PubSub+ or RabbitMQ.
Experience in ELK, Splunk, SIEM
Experience in software and system security.
Experience in configuring firewalls, proxies, network intrusion detection systems.
Experience with software defined network and software defined infrastructures.
Ability to do programming.
You will get to:
Work with the solution architects to design and implement infrastructure services on Cloud platform.
The candidate must be able to perform the following:
Network design - design subnets, routing rules and network-related components to support application transaction requirements.
High availability - setup and configure cloud components and services to ensure no single point of failure at every infrastructure level.
Capacity management and planning - devise mechanisms to monitor and capture system metrics and identifying potential bottlenecks at the infrastructure level.
Develop formulas for capacity sizing to cater to load projections.
Security and access control - design IAM roles and utilize cloud native controls to meet security objectives and every infrastructure level and cloud services used.
Infra automation - develop scripts and templates to automate infrastructure provisioning and changes.
Security configuration \xe2\x80\x93 cloud native security controls through design of security groups, network access control list and IAM roles to create micro-segmentation-based security. Secrets management, privileged access management (PAM), DDOS and WAF protection.