Top 3 Reasons To Join Us
Competitive salary
Premium health care package
7 working hours/day
The Job
Introduction
With over a decade of experience in IT and fintech, Blue Belt has become a leading software development company, delivering innovative technology solutions to a diverse global clientele. We specialize in developing web, mobile, payment, and blockchain applications that offer seamless user experiences. Headquartered in Tokyo, Japan, with a state-of-the-art Technology Hub in Hanoi, Vietnam, Blue Belt operates in more than ten countries, including Japan, Thailand, Indonesia, the Philippines, Malaysia, Taiwan, and Brazil. Our team of over 200 professionals brings a wealth of expertise to drive our global operations.
We are looking for a highly skilled DevOps Engineer to join our team, with a focus on deploying, scaling, and maintaining infrastructure for conversational AI and chatbot systems. You will work closely with AI engineers, software developers, and product teams to automate workflows, ensure high availability, and optimize performance for AI-driven applications
Job Description
Infrastructure Automation & CI/CD
- Design, implement, and maintain CI/CD pipelines for chatbot and AI services.
- Automate environment provisioning using tools like Terraform, Ansible, or Pulumi.
- Integrate testing and deployment workflows to support agile delivery cycles.
Cloud Infrastructure Management
- Build and manage infrastructure on cloud platforms AWS, tailored for AI workloads.
- Implement secure and scalable architectures for real-time chatbot interactions.
Monitoring, Logging & Incident Management
- Setup Logging Centralized using EFK (ElasticSearch, Fluentbit, Kibana)
- Set up monitoring tools (Prometheus, Grafana, ELK, or Datadog) for proactive alerting.
- Define and enforce SLOs/SLAs for chatbot uptime and response time.
- Lead incident response and root cause analysis for system failures.
Security & Compliance
- Ensure best practices in infrastructure security (IAM, VPC, secrets management).
- Support compliance efforts for data protection (GDPR, SOC2) in chatbot data pipelines.
- Perform ad-hoc DevOps tasks as required, including emergency patches, incident support, or rapid deployment of security updates.
AI Deployment Model
- Collaborate with teams to containerize and deploy NLP models (e.g., with Docker, Kubernetes).
- Manage GPU/TPU workloads, including dynamic scaling and resource optimization.
- Monitor model inference performance and latency across staging and production environments.
- Optimize cost, compute, and storage strategies for high-volume inference and training.
Your Skills and Experience
Key requirements for this position include:
- At least 5 year experience in Network/ System Engineer position;
- Bachelor's degree in computer science, Information Technology or other technical field preferred from TOP UNIVERSITY specializing in Information Technology
- Security concepts related to DNS, routing, authentication, VPN, proxy services and DDOS mitigation technologies.
- Having experience in design/implementing networks is required. HA pattern is a big advantage.
- Have knowledge and experience in cloud AWS (VPC, EC2, EKS, RDS, MSK, OPENSEARCH, ELASTICACHE, SES...)
- Have experience with EKS, K8s, and the ability to write helm charts.
- Have experience with databases MySQL, PostgreSQL.
- Have experience hardening OS and troubleshooting.
- Have experience with Linux as Centos, Ubuntu.
- Have experience with ActiveMQ, Redis, and Memcache.
- Have experience in monitoring, and logging alerting tools.
- Have experience with CI/CD tools such as Jenkins and Gitlab.
- Have experience with API Gateway and Load balancing
- Familiar with configuration and operating Nginx/Nginx Ingress/Apache.
Plus:
- Experience supporting LLM/chatbot-based products in production.
- Having knowledge of GCP or Azure
- Having experience with Terraform/Terragrunt and Ansible
- Having knowledge and experience ElasticSearch, and Kafka
- Having knowledge and experience with postfix, FTP servers, and other services
- Having knowledge about security, checking vulnerability and fix/update OS and
Personal requirements
- Hard working, responsible, strong interpersonal and communication skills.
- Have the ability to study and capture new business domains quickly.
- Ability of working independently and teamwork, can work under high pressure.
- Ready to work overtime.
Why You'll Love Working Here
- Salary: open to negotiate
- Working hours: 9:00 - 17:00 (5 days per week); Breaking time: 12:00-13:00
- 100% Offered Salary in probation time
- Modern working equipment
- Salary Review: 2 times/year based on employee's performance and contribution;
- Well-equipped with insurance package as stated by Labor code
- Premium PVI Health Insurance Package for all members
- Transportation allowance and free parking included.
- Technical seminars and workshops annually.
- Free snack, coffee, tea available.
- Variety of corporate events: weekly tea-break, monthly birthday parties, quarterly team building to New Year party, company trip etc.
- Friendly, open and fast-paced environment where every idea is welcomed.
- Other benefits as per stated in Vietnamese Labor Law