Mô tả công việc
MỤC ĐÍCH CÔNG VIỆC
• Lead the team to improve scalability, reliability, and cost-efficiency of the Data Platform.
• Design, build, and deploy data pipelines (batch & streaming) using Spark orchestrated via Airflow.
• Develop libraries and frameworks for data ingestion, transformation, and governance with clean architecture principles.
• Collaborate with Data Architects to design/review data models, enforce data contracts, and maintain schema governance.
• Optimize performance with partitioning, caching, Z-Ordering, and metadata management in lakehouse environments (Delta/Iceberg/Hudi).
• Ensure security and compliance: IAM, encryption, secrets management, and GDPR/CCPA adherence.
• Drive CI/CD for data workflows, IaC (Terraform), and container orchestration (Kubernetes).
• Monitor SLOs/SLAs, implement alerting, and lead incident responses and postmortems.
• Design and operate end-to-end ML/LLM pipelines: data prep, training, evaluation, and deployment.
• Build RAG architectures, vector search, and embedding pipelines for LLM-based applications.
Yêu cầu
• Bachelor's or Master's degree in Computer Science,
Software Engineering, Information Technology, or a related technical field
• English is required
• Have 5+ years of experience as a AI Engineer
• Have experience in Cloud (AWS/Azure/GCP)
• Experience in AI and LLM technologies, including prompt engineering, embeddings, and retrieval-augmented generation (RAG).
• Hands-on experience with vector databases (ChromaDB, Vector Search) and LLMOps practices.
• Experience with Databricks (Delta Lake, Unity Catalog, Delta Live Tables) or similar lakehouse technologies is a strong plus.
• Proven ability in performance tuning and optimization for Big Data workloads (Spark/Flink, partitioning, shuffle strategies, caching).
• Familiarity with modern data transformation frameworks (dbt).
• Extremely proficient in at least 1 programming language (Python/Scala/Java)
• Strong experience in systems architecture - particularly in complex, scalable, and fault tolerant distributed systems
• Good at multi-threading, atomic operations, computation framework: Spark (DataFrame, SQL, ...), distributed storage, distributed computing
• Understand designs of resilience, fault-tolerance, high availability, and high scalability, ...
• Tools: CI/CD, Gitlab, ...
• Good at communication & team working
• Being open-minded, willing to learn new things
Quyền lợi
Khác
Theo quy định của Công ty
Thông tin khác
NGÀY ĐĂNG
29/05/2026
CẤP BẬC
Nhân viên
NGÀNH NGHỀ
Công Nghệ Thông Tin/Viễn Thông > Data Engineer/Data Analyst/AI
KỸ NĂNG
AI Engineer, AI Technology, Vector Database, Python, Cloud
LĨNH VỰC
Bảo hiểm
NGÔN NGỮ TRÌNH BÀY HỒ SƠ
Bất kỳ
SỐ NĂM KINH NGHIỆM TỐI THIỂU
5
QUỐC TỊCH
Không giới hạn
Xem thêm
Thông tin chung
Nơi làm việc
Cách thức ứng tuyển
Ứng viên nộp hồ sơ trực tuyến bằng cách bấm nút Ứng tuyển bên dưới:
Hạn nộp: 29/06/2026