We're hiring a Senior Data Scientist to lead the design, development, and deployment of advanced machine learning and AI-driven solutions across AWS and GCP. You'll own end-to-end model lifecycle - from data exploration to production - and collaborate closely with data engineering teams to ensure
scalability and reliability.
This role also involves mentoring data scientists, driving MLOps best practices, and supporting presales and proposals through solution design, estimation, and technical presentations to customers. Experience with AIOps/MLOps, data observability, and AI platform optimization is a strong plus
Modeling & Solution Design
Develop, train, and optimize machine learning and statistical models for predictive analytics, recommendation systems, forecasting, and anomaly detection.
Translate business problems into analytical solutions and communicate insights to non-technical stakeholders.
Design robust feature engineering, data preprocessing, and model validation frameworks.
Select appropriate ML algorithms and technologies for scalability, interpretability, and cost efficiency.
AI Platform & MLOps
Collaborate with Data Engineering teams to operationalize models in production environments.
Implement MLOps pipelines for model versioning, CI/CD, monitoring, and retraining (using tools such as SageMaker, Vertex AI, or MLflow).
Define and track model performance metrics, data drift, and retraining triggers.
Integrate observability and AIOps tools for proactive issue detection and system resilience.
Team Leadership
Mentor junior data scientists and analysts through code reviews, modeling best practices, and project guidance.
Drive model governance, documentation, and experimentation culture across the team.
Partner with engineering and
business leaders to define data science strategy and delivery roadmap.
Customer & Presales Engagement
Work closely with business and sales teams to understand customer challenges and translate them into AI/ML solutions.
Support proposal creation, including architecture options, effort estimation, and value articulation.
Deliver presentations, demos, and technical discussions to help clients understand model capabilities and outcomes.
Research & Innovation
Explore and evaluate emerging trends in AI/ML, generative AI, and LLMs to guide platform evolution.
Prototype and benchmark new models or frameworks to improve performance, cost, and explainability.
Contribute to reusable assets, accelerators, and internal knowledge sharing.
4+ years in Data Science or Applied Machine Learning, including in a senior or team lead role.
Strong hands-on experience across both AWS and GCP ecosystems:
AWS: S3, SageMaker, Glue/EMR, Lambda, Step Functions, Athena, Redshift.
GCP: BigQuery, Vertex AI, Dataflow, Dataproc, Composer (Airflow), Pub/Sub.
Proficiency in Python, including libraries such as pandas, NumPy, scikit-learn, TensorFlow, PyTorch, and XGBoost.
Strong foundation in statistics, experimentation (A/B testing), and hypothesis validation.
Skilled in SQL and data manipulation for large-scale analytics.
Familiarity with MLOps tools (MLflow, Kubeflow, SageMaker Pipelines, Vertex Pipelines).
Understanding of data modeling, feature stores, version control, and reproducible experiments.
Experience with IaC (Terraform/CloudFormation) and containerization (Docker/Kubernetes) is a plus.
Knowledge of data governance, privacy, and compliance (GDPR-style controls, PII management).
Good command of English
Strong Pluses
Experience with Databricks, Snowflake, or other unified data platforms.
Knowledge of AIOps, data observability, and quality monitoring tools (Great Expectations, Monte Carlo, Soda).
Proven track record in AI solution presales, technical proposals, and executive-level communication.
Exposure to LLM-based solutions, generative AI, or RAG (Retrieval-Augmented Generation) architectures.
Domain experience: biotechnology, bioinformatics, genomics, or healthcare data analytics.