Lead SRE - AWS, Terraform
JP Morgan
As a Lead Site Reliability Engineer within the CIB Markets Sales, Research and Data organization at JPMorgan Chase, you will play a pivotal leadership role in your team. You will leverage your strong knowledge across multiple technical domains to advise and support software engineering teams globally. Your hands-on role will involve migrating and managing applications in the public cloud, promoting SRE principles and practices. You will work on initiatives such as unified telemetry, application and infrastructure modernization, SLO and SLI onboarding, advanced deployment strategies, and performance and scalability improvements, all aimed at reducing operational risks and enhancing our products.
You will have public cloud expertise and ability to take the lead in defining observability practices to ensure stability, performance and fast recovery while making it easier for our teams to build a better products for our clients. You ensure requirements are accounted for in your products’ design, test service level indicators for effectiveness and customer experience, and drive implementation to production.
Job responsibilities
Design, code, test, and deliver software to automate manual operational work, including self-healing and resiliency patterns for public cloud services and engineering teams.Defining and implementing a telemetry strategy, including rollout of APM (application performance monitoring) and cloud telemetryYou are a culture carrier and adoption site reliability champion for your team by demonstrating site reliability principles and practices every day and mentoring technologists within the organizationTroubleshoot priority and escalation incidents, facilitate blameless post-mortems and ensure permanent closure of incidents and subsequent problem tasks.Engage and evangelize with development team throughout their SDLC to develop software for reliability and scale, ensuring minimal refactoring or changesIdentify application patterns and analytics in support of better service level objectivesDesign automated software and product upgrades, change management, and release management solutions.Provides comprehensive and ongoing guidance, tools, and solutions to support the firms’ growthWorks towards becoming an expert on the applications and platforms in your remit by understanding its interdependencies and limitations and driving to evolve and debug the critical components of it
Required qualifications, capabilities and skills
Preferred qualifications, capabilities and skills
Experience defining non-functional standards and blueprints related to supportability – logging, alerting, resiliency patterns, etc.Working knowledge of infrastructure components (e.g. routers, load balancers, cloud products, container systems, compute, storage, and networks)Ability to partner with and influence architecture teams in defining non-functional application supportability standardsProven leadership skills with drive for continuous improvementAWS Cloud Certification, Linux Foundation CKA/CKAD, Terraform Associate and other relevant certifications are a plus
Confirm your E-mail: Send Email
All Jobs from JP Morgan