We have an exciting and rewarding opportunity for you to take your Data Scientist career to the next level.
As the Data Scientist Lead within our Machine Learning and AI team, you will have the opportunity to develop and productionize high-quality machine learning models, services, and platforms that create significant technological and business impact. You will collaborate with cross-functional teams to identify business requirements and develop data-promoted solutions using GenAI technologies. Your role will involve designing scalable data processing pipelines, building and maintaining data lakes, and implementing data visualization solutions to provide actionable insights. You will also champion a DevOps model to support the maturity of the ML development life cycle.
Job Responsibilities:
• Develop and productionize high-quality machine learning models, services, and platforms to create significant technological and business impact.
• Design and implement scalable and reliable data processing pipelines, performing analysis and deriving insights to optimize business outcomes.
• Use LLMs for Generative AI applications, including text generation, classification, and question answering.
• Collaborate with cross-functional teams to identify business requirements and develop data-driven solutions using GenAI technologies.
• Build and maintain data lakes and data processing workflows using Databricks to support machine learning operations.
• Implement data visualization and analytics solutions using ThoughtSpot to provide actionable insights to stakeholders.
• Conduct research on prompt engineering techniques to enhance the performance of LLM-based models.
• Analyze and interpret complex datasets to evaluate model performance and identify areas for improvement.
• Communicate technical concepts and results effectively to both technical and non-technical stakeholders.
• Develop end-to-end ML pipelines for real-time and batch predictions and integrate them with existing applications.
• Champion a DevOps model and support the maturity of the ML development life cycle (MDLC).
Required Qualifications, Capabilities, and Skills:
• Advanced degree in Computer Science, Data Science, Mathematics, or a related field.
• 8+ years of applied experience in data science, machine learning, or related areas.
• Strong programming skills in Python, with experience in machine learning frameworks such as PyTorch or TensorFlow.
• Experience in building and managing data lakes and data processing workflows using Databricks.
• Proficiency in using GenAI models (OpenAI or similar) to solve business problems and in building AI Agents and MCP Servers
• Solid understanding of data structures, algorithms, and machine learning concepts.
• Experience in NLP and deep learning, with recent exposure to prompt engineering on LLMs and with big data technologies (Hadoop, Spark, etc.)
• Hands-on experience with MLOps tools and practices, ensuring seamless integration of models into production environments.
• Experience with Enterprise Cloud infrastructure and monitoring tools like Data Dog,, Splunk, Elasticsearch, and Grafana.
Preferred Qualifications, Capabilities, and Skills:
• Familiarity with data visualization tools like ThoughtSpot.
• Experience in developing APIs and integrating machine learning models into software applications.
• Knowledge of modern development technologies and tools such as Agile, CI/CD, Git, Terraform, and Jenkins.