Mô tả công việc
Tóm tắt công việc
ABOUT THE JOB
We are looking for a hands‐on Senior DevOps Engineer with strong expertise in Automation and Managed Monitoring / Enterprise Observability. You will play a key role in designing, building, and operating scalable, reliable, and observable platforms across cloud environments.
MAIN RESPONSIBILITIES
Define/implement tools and processes to standardize and automate the way project software is developed, built, tested, and deployed;
Ensure a homogeneous way of implementing continuous integration and/or continuous delivery across projects;
Assist with the design and development of resilient, secure, supportable, and scalable systems;
Automate infrastructure deployments and rollbacks for all developed work assuming responsibility for process support;
Lead investigations into production incidents with assistance from the development team;
Proactively manage any risks to the production environment;
Continually improve the supportability of our systems by feeding improvements back into the design and development cycles;
Fulfil other tasks as assigned by your People Leader and/or authorized representative of NAB Vietnam from time to time.
Yêu cầu
Must-have
Strong experience with Kubernetes (production); EKS a plus.
Hands‐on experience with AWS (core services, networking, IAM, security best practices).
Infrastructure as Code using Terraform (modules, workspaces, CI/CD integration).
Solid understanding of SRE/AIOps practices (SLOs, error budgets, runbooks, auto‐remediation).
Experience building and maintaining CI/CD pipelines.
OpenTelemetry instrumentation and Collector pipelines (metrics, traces, logs).
Hands‐on experience with Prometheus and Grafana.
Experience with the monitoring stack: Grafana LGTM (Loki, Grafana, Tempo, Mimir), OpenSearch; AppDynamics (optional) - dashboards, alerting, retention.
Kafka (Apache Kafka only): brokers, Connect/Streams, Schema Registry; monitoring consumer lag, throughput, error rates, DLQs (with alerting & dashboards).
Application platforms & deployments: Java (Spring Boot), [protected info] (Express/Nest), React (RUM/synthetics, source maps).
Deployment strategies: blue/green, canary, feature flags; trace‐context propagation.
Experience with infrastructure/application performance testing (stress & load); baselines/benchmarks and regression detection integrated into CI/CD.
Effective English communication skills.
Nice to Have
Experience supporting large‐scale, enterprise environments
Familiarity with multi‐cloud or hybrid cloud architectures
Thông tin khác
DevOps
Kubernetes
Amazon EKS
Java
Load Testing
Performance Testing
Stress Testing
NodeJS
ExpressJS
Grafana
Apache Kafka
AWS
Spring Boot
ReactJS
Terraform
Prometheus
LokiJS
NestJS
AppDynamics
CI/CD
IAM
OpenSearch
Thông tin chung
Cách thức ứng tuyển
Ứng viên nộp hồ sơ trực tuyến bằng cách bấm nút Ứng tuyển bên dưới:
Hạn nộp: 28/04/2026