Pune, IND
48 days ago
Data Engineering & AI Platform Engineer - Quality
**DESCRIPTION** Design, build, and maintain scalable data and AI platforms, including graph-based infrastructure, to enable document intelligence, multimodal information retrieval, and AI-driven validation pipelines across enterprise environments. **Key Responsibilities** + Design and implement scalable data engineering and AI platforms for enterprise environments. + Develop and optimize pipelines for document intelligence, multimodal retrieval, and AI-driven validation. + Build and manage vector database solutions and embedding-based search workflows. + Architect and maintain graph-based systems using Neo4j and GraphRAG for advanced knowledge representation. + Ensure robust MLOps practices, including model lifecycle management and data governance. + Collaborate with cross-functional teams to integrate RAG frameworks and multimodal data sources. + Automate data orchestration and visualization using Python and Streamlit. **RESPONSIBILITIES** **Core Competencies:** + **Collaborates:** Builds strong partnerships and works effectively with others to meet shared objectives. + **Communicates Effectively:** Conveys information clearly across diverse audiences. + **Customer Focus:** Develops strong relationships and delivers customer-centric data solutions. + **Interpersonal Savvy:** Engages openly with diverse teams and perspectives. + **Data Analytics, Mining & Modeling:** Extracts insights, builds models, and drives data-driven decision-making. + **Data Communication & Visualization:** Tells a compelling story through data visuals and narratives. + **Data Literacy & Quality:** Ensures reliable, high-quality data across analytics ecosystems. + **Values Differences:** Recognizes and leverages diverse perspectives for innovation and growth. **Skills & Technical Expertise** + **Programming & Automation:** Advanced Python for data orchestration, automation, and AI workflows; Streamlit for interactive dashboards. + **Data Engineering:** Hands-on experience with Databricks, PySpark, and Azure Data Lake for scalable ETL and data transformation. + **AI & ML:** Expertise in RAG frameworks, multimodal data integration, and embedding pipelines for semantic search. + **Graph Technologies:** Advanced Neo4j graph modeling and GraphRAG for knowledge representation and retrieval. + **Vector Databases:** Building and managing embedding-based search solutions. + **MLOps & Governance:** End-to-end model lifecycle management, deployment, monitoring, and compliance. + **Problem-Solving & Communication:** Strong analytical thinking and ability to collaborate with cross-functional teams. + **Domain Knowledge:** Understanding of manufacturing quality processes (preferred). **QUALIFICATIONS** **Experience:** + **3–6 years** of relevant experience in **data engineering, AI application development, and deployment** . + Proven hands-on experience in **building AI, data, and graph infrastructure** across enterprise environments. + Experience in **data governance** , **model lifecycle management** , and **enterprise analytics enablement** . **Qualifications:** + Bachelor’s or Master’s degree in **Computer Science, Information Technology** , or a **related technical discipline** . + This position may require licensing for compliance with export controls or sanctions regulations. Role Category - **On-site with Flexibility** Dayshift **Job** Quality **Organization** Cummins Inc. **Role Category** Off-site Remote **Job Type** Exempt - Experienced **ReqID** 2421966 **Relocation Package** No **100% On-Site** No
Confirm your E-mail: Send Email