At CDAO (Chief Data Analytics Office), we drive our firm’s strategic investments in AI/ML and data-oriented tools and capabilities. Our Platform Engineering team is at the forefront of building innovative platforms, automating infrastructure operations, and enabling Agentic-based AIOps platforms. Our mission is to enhance scalability, security, and reliability for CDAO-hosted managed services.
As a Machine Learning Engineer within our platform operations team, you will be tasked with the design, construction, and maintenance of our AIOps solution. This role demands a profound knowledge of AI/ML technologies, IT infrastructure, and platform engineering.
Job Responsibilities:
Design, develop, and maintain software applications with integrated AI/ML capabilities with a key focus on AIOps. Collaborate with cross-functional teams to gather and analyze requirements, translating them into technical solutions. Develop and implement software architecture and design patterns to ensure scalability and performance. Write clean, maintainable, and efficient code in one or more general-purpose programming languages: Python, Java, C, C++, Go. Implement and manage data pipelines to preprocess and transform data for AI/ML models. Integrate AI/ML models into software applications and ensure seamless deployment into production environments. Optimize software applications for performance, reliability, and scalability. Conduct code reviews and provide technical guidance to junior developers. Stay up-to-date with the latest advancements in software engineering and AI/ML technologies. Ensure adherence to software development best practices, including agile and lean methodologies. Apply SRE principles to enhance system reliability, performance, and availability. Implement monitoring and alerting solutions to proactively identify and resolve issues.Required Qualifications, Capabilities, and Skills:
Bachelor’s degree in Computer Science or equivalent practical experience. Strong experience working with multiple technology teams in a matrixed organization. 7+ years of experience as a software developer with a focus on integrating AI/ML solutions. Strong programming skills in Python, with experience in developing and maintaining production-level code. Experience with application architecture and design patterns. Proficiency in working with large datasets and data preprocessing. Solid understanding of AI/ML algorithms and techniques, including deep learning and natural language processing. Familiarity with AI/ML libraries and frameworks, such as TensorFlow, PyTorch, scikit-learn, and Keras. Experience with cloud platforms, such as AWS or Azure, for deploying and scaling software applications. Strong problem-solving and analytical skills and excellent communication and collaboration skills.Preferred Qualifications, Capabilities, and Skills:
Familiarity with DevOps practices for software deployment and monitoring. Experience with SQL and NoSQL databases, Linux/Unix environments, and tools like Terraform and Apache Kafka. Experience with distributed computing frameworks, such as Apache Spark. Experience in implementing SRE practices to improve system reliability and performance.