Senior Site Reliability Engineer
Date Submitted: 12-01-2021 | End date: 12-02-2021
Industry Specialization | : | |
Type of Employment | : | Permanent |
Minimum Experience | : | 5 years or more |
Work Location | : | West Singapore |
Job Description:
- Design and implement toolchain to enable government agencies to adopt SRE in developing and operating products and services
- Design and own SRE processes within government security and policies requirements
- Be-trained and to-train government agencies on SRE. Be the subject matter of experts in SRE
- Ensure the SRE toolchain and processes up to date with the industry practices, and the SLO and error budget are adhered within the toolchain as well as on boarded products and services
- Assist in the problem solving and development to enhance the reliability and sustainability of products and services
Requirements:
- Degree or Diploma in Computer Science, Computer or Electronics Engineering, Information Technology or related disciplines.
- Operational Automation with Terraform, Ansible, shell scripting, or python programming.
- Monitoring platforms such as Prometheus, Grafana, Dynatrace and etc.
- Infrastructure Services Administration (VM, DB, Network Services) on cloud and on-prem such as patching and backup
- Infrastructure and Policies as Code and DevOps
- Application Monitoring and Metrics Development
- Logging and tracing platforms/tools such as Elastic, Fluentd, Kibana, Jaegar
- At least 10 years of full stack development experience using open-source web platforms and frameworks (nodeJS, spring framework), RestAPI
- Experienced in deployment automation and monitoring automation
- Familiar with AWS cloud services.
- Hands-on experience in Windows and Linux environments.
- Ability to work and thrive in a highly iterative environment, learn rapidly and master diverse web technologies and techniques
- Experience building and maintaining API-oriented services
- Self-motivated and good communication skills