Play a key role in ensuring system reliability at one of the world’s most iconic and largest financial institutions.
As a Lead Site Reliability Engineer at JPMorgan Chase within the Commercial & Investment Bank's Global Banking technology team, you will use technology to solve business problems and leverage software engineering best practices as we strive towards excellence. This role often works independently to execute small to medium projects, but you’ll also have the opportunity to collaborate with cross functional teams to continually improve your level of knowledge about JPMorgan Chase’s business and relevant technologies.
Job responsibilities
Understands service level indicators and utilizes service level objectives to proactively resolve issues before they impact customersSupports the adoption of site reliability engineering best practices within your teamExecutes small to medium projects independently with initial direction and eventually graduates to designing and delivering projects by yourselfLeverages technology to solve business problems by writing high quality, maintainable, and robust code following best practices in software engineeringParticipates in triaging, examining, diagnosing, and resolving incidents and work with others to solve problems at their rootRecognizes the toil within your role and proactively works towards eliminating it through either systems engineering or updating application codeUnderstands observability patterns and strives to implement and improve service level indicators, objectives monitoring, and alerting solutions for optimal transparency and analysisRequired qualifications, capabilities, and skills
Formal training or certification on software engineering concepts and 5+ years applied experience
Minimum 10+ years of SRE Engineering and Cloud experienceExperience with coding, primarily in Python and Java programming languagesExperience maintaining a Cloud-base infrastructure viz. AWS. Experience with Terraform is mandatory.Experience with site reliability concepts, principles, and practicesExperience with observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, and othersExperience with containers or a common Server OS such as Linux and WindowsKnowledge of emerging software, applications and technical processes within Cloud based and AI/ML based projectsExperience with continuous integration and continuous delivery tools like Jenkins, GitLab, etc.Eagerness to participate in learning opportunities to enhance one’s effectiveness in executing day-to-day project activitiesAbility to demonstrate and apply existing and new system processes, methodologies, and skills to contribute to the development of systems
Preferred qualifications, capabilities, and skills
General knowledge of financial services industryFamiliarity with SRE activities in AI/ML based projectsAbility to work in a large, collaborative team and demonstrates the willingness to vocalize ideas with peers and managersUnderstanding of how to prioritize and adjust work plans to adapt to changes in assigned responsibilities and projects