Vị trí công việc này hiện tại đã hết hạn nộp hồ sơ, bạn có thể tham khảo thêm một số công việc liên quan phía dưới
Mô tả công việc
Big Data Mining: Extract and mine large-scale datasets from major e-commerce platforms in Vietnam, China, Korea, Southeast Asia,...
Data Processing: Clean, transform raw data into structured formats suitable for analytics and machine learning.
Data Infrastructure: Build automated pipelines and cloud solutions. (e.g., AWS, GCP,...).
Data Integration and Management:** Develop data warehouses and data lakes for optimal data storage and retrieval.
LLM Data Pipeline: Develop pipelines for Large Language Models (LLM), including RAG , LangChain, or LangGraph.
Visualization: Create visualizations and reports to communicate insights effectively.
Yêu cầu
Education: Final year student or fresh graduate in Computer Science, Data Science, Information Technology, or related fields
Proficient in Python, with experience using Pandas, PySpark, or similar libraries
Experience with web scraping tools (e.g., BeautifulSoup, Scrapy, Selenium)
Understanding of data architecture: warehouses, lakes, and cloud storage
Familiarity with ETL/ELT tools (e.g., Apache Airflow) and SQL
Basic knowledge of web structures (HTML/CSS/JS) is a plus
Soft Skills: Strong problem-solving skills, attention to detail, and a passion for data engineering.
Communication skills: good communication skills in Vietnamese and English
Quyền lợi
Web data mining and handling large-scale real-world datasets
Building automated data pipelines with Python, PySpark, and Airflow
Thông tin chung
- Ngày hết hạn: [protected info]
- Thu nhập: Thỏa thuận