Site Reliability EngineerCông Ty TNHH Digital Power Media
Hình thức: Toàn thời gian
Ngày đăng: 31/07/2024
Hạn nộp: 31/08/2024
Vị trí công việc này hiện tại đã hết hạn nộp hồ sơ, bạn có thể tham khảo thêm một số công việc liên quan phía dưới
Mô tả công việc
We are actively looking for a highly skilled Site Reliability Engineer to become a valuable member of our team. The ideal candidate will play a key role in engineering efforts, contributing from design to implementation, and addressing intricate technical challenges related to developer and engineering productivity and velocity.
In the position of Site Reliability Engineer, you will be responsible for crafting and implementing robust and scalable infrastructure and services utilized by our development team.
Responsibilities:
Define and manage cloud infrastructure as code (IAC), improve the CI/CD pipeline, ensure scalability and availability of the system, build monitoring stack, automate build, configuration, and deployment orchestration scripts...
We are utilizing a broad toolset for DevOps/Infra like Docker, Helm, Github Actions... and leveraging multiple services from AWS: EC2, ECS, S3, CloudFront, RDS, IAM, Route53, CloudWatch...
Minimize incident impacts by being informed upfront with monitoring, alerts, logs and metrics and having an eye on IT standards and security.
Responsible for zero-downtime of the system for millions of users.
To work with the engineering team and take architectural decisions.
Support troubleshooting efforts during incidents, applying root cause analysis to prevent recurrences.
In the position of Site Reliability Engineer, you will be responsible for crafting and implementing robust and scalable infrastructure and services utilized by our development team.
Responsibilities:
Define and manage cloud infrastructure as code (IAC), improve the CI/CD pipeline, ensure scalability and availability of the system, build monitoring stack, automate build, configuration, and deployment orchestration scripts...
We are utilizing a broad toolset for DevOps/Infra like Docker, Helm, Github Actions... and leveraging multiple services from AWS: EC2, ECS, S3, CloudFront, RDS, IAM, Route53, CloudWatch...
Minimize incident impacts by being informed upfront with monitoring, alerts, logs and metrics and having an eye on IT standards and security.
Responsible for zero-downtime of the system for millions of users.
To work with the engineering team and take architectural decisions.
Support troubleshooting efforts during incidents, applying root cause analysis to prevent recurrences.
Yêu cầu
Your skills and experience:
At least 4 years' experience in the same position SRE with AWS technologies and services.
Understanding the well-architected framework of AWS to build and optimize systems on the AWS.
Experience with AWS: IAM, EC2, ALB, S3, ECS, Cloudwatch, CloudFormation....
In-depth knowledge of Kubernetes, including its architecture, deployment, and management, with a focus on CI/CD for web applications.
Experience in Terraform & Infrastructure as Code (IAC) principles.
Have an in-depth understanding of microservice architecture, API management, and distributed systems concepts.
Proficiency in implementing monitoring and alerting solutions (e.g., Grafana, Prometheus, ELK stack) to ensure optimal system performance and availability.
Familiar with Linux/ Unix Administration and scripting using shell scripts.
Experience with containerization and orchestration tools (Docker Kubernetes).
Strong understanding of command-line tools and distributed version control system such as GIT.
Mentality to share and the aspiration to constantly improve yourself and learn new things.
Self-driven, proactive.
Excellent problem-solving skills, with the ability to analyze and resolve complex technical issues efficiently.
The ability to work under pressure.
Nice to have:
AWS Certified Solutions Architect - Professional or AWS Certified DevOps Engineer - Professional certification is highly desirable.
Understanding of security best practices in web development.
Knowledge of best practices and IT operations in an always-up, always-available service
Should be able to design high-level/low-level network/architecture and properly document.
At least 4 years' experience in the same position SRE with AWS technologies and services.
Understanding the well-architected framework of AWS to build and optimize systems on the AWS.
Experience with AWS: IAM, EC2, ALB, S3, ECS, Cloudwatch, CloudFormation....
In-depth knowledge of Kubernetes, including its architecture, deployment, and management, with a focus on CI/CD for web applications.
Experience in Terraform & Infrastructure as Code (IAC) principles.
Have an in-depth understanding of microservice architecture, API management, and distributed systems concepts.
Proficiency in implementing monitoring and alerting solutions (e.g., Grafana, Prometheus, ELK stack) to ensure optimal system performance and availability.
Familiar with Linux/ Unix Administration and scripting using shell scripts.
Experience with containerization and orchestration tools (Docker Kubernetes).
Strong understanding of command-line tools and distributed version control system such as GIT.
Mentality to share and the aspiration to constantly improve yourself and learn new things.
Self-driven, proactive.
Excellent problem-solving skills, with the ability to analyze and resolve complex technical issues efficiently.
The ability to work under pressure.
Nice to have:
AWS Certified Solutions Architect - Professional or AWS Certified DevOps Engineer - Professional certification is highly desirable.
Understanding of security best practices in web development.
Knowledge of best practices and IT operations in an always-up, always-available service
Should be able to design high-level/low-level network/architecture and properly document.
Quyền lợi
Competitive Salary
5 working days/ week
Provided with a Mac book/Screen
Attractive benefits for team activities (team building, Happy Friday, Happy Hour..)
Comfortable work space and friendly colleagues
5 working days/ week
Provided with a Mac book/Screen
Attractive benefits for team activities (team building, Happy Friday, Happy Hour..)
Comfortable work space and friendly colleagues
Thông tin khác
Cấp bậc
Nhân viên
Kinh nghiệm
4 năm
Số lượng tuyển
2 người
Hình thức làm việc
Toàn thời gian
Giới tính
Không yêu cầu
Nhân viên
Kinh nghiệm
4 năm
Số lượng tuyển
2 người
Hình thức làm việc
Toàn thời gian
Giới tính
Không yêu cầu
Giới thiệu công ty
Việc làm tương tự
Senior Back-End Developer (Python, Devops)
Công Ty Trách Nhiệm Hữu Hạn Giải Pháp Toàn Cầu Iij Việt Nam
Thỏa thuận
Hồ Chí Minh
29/10/2024
Senior Golang Developer English required, DevOps
CÔNG TY TNHH MỘT THÀNH VIÊN EKINO VIETNAM
Attractive !
Hồ Chí Minh
29/10/2024
VIỆC LÀM SENIOR ANDROID DEVELOPER tại TPHCM - Tps Software
Công ty Cổ phần Phần mềm TPS
13,000,000 - 16,000,000 VNĐ 0 VNĐ
Hồ Chí Minh
25/10/2024
TUYỂN DỤNG JAVA DEVELOPER tại TPHCM - Tps Software
Công ty Cổ phần Phần mềm TPS
14,000,000 - 18,000,000 VNĐ 0 VNĐ
Hồ Chí Minh
25/10/2024
VIỆC LÀM REACT NATIVE DEVELOPER tại TPHCM - Tps Software
Công ty Cổ phần Phần mềm TPS
14,000,000 - 16,000,000 VNĐ 0 VNĐ
Hồ Chí Minh
25/10/2024
Vị trí Site Reliability Engineer do công ty Công Ty TNHH Digital Power Media tuyển dụng tại Hồ Chí Minh, Joboko tự động tổng hợp mức lương Tới 2,500 USD, tìm thêm việc làm về Site Reliability Engineer hoặc công ty Công Ty TNHH Digital Power Media ở các link phía trên
Giới thiệu công ty