Mô tả công việc
Tóm tắt công việc
Role Summary
SRE ensures smooth day-to-day operations of the Bank. Understanding of production system access and control, production deployment, Amazon Web Services, Kubernetes, continuous deployment and systems observability is essential for this role.
Key Responsibilities
Participate in on-call rotations(*) to provide support for critical systems. Engineers are required to work on a rotating 2-2-2 schedule: 2 morning shifts followed by 2 days off, 2 afternoon shifts followed by 2 days off, and 2 night shifts followed by 2 days off.
Morning: 09:00 AM - 06:00 PM
Afternoon: 05:00 PM - 02:00 AM
Night: 01:00 AM - 10:00 AM
Overtime Policy:
6:00 PM - 10:00 PM: 30%
10:00 PM - 6:00 AM: 100%
Public Holidays: 300%
Resolve system incident when occurs
Deployment of changes into staging and production environments
Work with Platform Engineers to understand the changes
Develop deployment pipeline for changes
Understand the changes and develop observability (monitoring and alert) according to the changes
Develop and conduct resiliency testing solution
Continuous enhancement of monitoring solution
Create and update operation runbooks
Automate operation runbook.
Yêu cầu
Technical Skill
Strong experience with Amazon Web Services
Strong experience and understanding of Kubernetes system
Scripting skills with Python or Bash
Experience in continuous deployment tools
Harness (good to have)
Experience in infrastructure as code (IaC) tools
Terraform
Experience with observability solutions
Prometheus & Grafana
SumoLogic (good to have)
Soft Skills
Good in communication and able to communicate fluently in English
Good problem solving skill
Self-motivated and able to learn fast
Thông tin khác
AWS
Kubernetes
Python
Harness
Grafana
Bash
Terraform
Prometheus
SumoLogic
IaC
Thông tin chung