Hong Kong
1 day ago
Technology Support III, Collateral Site Reliability Engineering

Propel operational success with your expertise in technology support and a commitment to continuous improvement. 
As a Technology Support III team member in Securities Services Technology, you will ensure the operational stability, availability, and performance of our production application flows. Encourage a culture of continuous improvement as you troubleshoot, maintain, identify, escalate, and resolve production service interruptions for all internally and externally developed systems, leading to a seamless user experience. 

 

 

Job responsibilities

•    Provide business facing technology support to business and operations groups across the Asia Pacific region.
•    Work within a follow-the-sun support model with global counterparts. 
•    Manage production technology incidents to resolution, ensuring timely engagement, escalation and effective communication to business, technology, and vendor partners.
•    Perform post incident analysis, identifying, tracking and implementing preventative measures.
•    Act as Subject Matter Expert (SME) for key applications, responsible for maintaining global best practice and hygiene standards.
•    Assist in the monitoring of production environments for anomalies and address issues utilizing standard observability tools.
•    Act as a key contributor in the continued development of tools, frameworks & techniques to improve productivity and quality of the production support, adopting SRE principles to manage and support the environment. 
•    Analyze complex situations and trends to anticipate and solve incident, problem, and change management in support of full stack technology systems, applications, or infrastructure.

Required qualifications, capabilities, and skills 

•    6+ years previous experience in Technology for Financial and Banking sector, with expertise in troubleshooting, resolving, and maintaining information technology services.
•    Bachelor's degree in Engineering, Computer Science, or Information Technology.
•    Proven track record of Production Support & Site Reliability Engineer (SRE): A clear understanding of SRE protocols and methodologies 
•    Familiar with observability, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, and others.
•    Support Management skills: design and use monitoring dashboards for day-to-day support, generate service KPIs, report on service stability & performance and log monitoring. 
•    Proven track record of running Incident & Problem Management calls for business impacting outages, performing post incident analysis, identifying & implementing preventative measures and lessons learned following outages. 
•    Excellent interpersonal relationship and communication skills, along with strong analytical and problem-solving skills. 
•    Able to drive issue resolution across different support teams. 
•    Experience in debugging and maintaining applications in a large corporate environment with one or more modern programming languages and database querying languages.
 

Preferred qualifications, capabilities, and skills 

•    Passion for learning new technologies and driving innovative solutions.
•    Experience with Kubernetes for container orchestration. 
•    Scripting languages: Perl, Python, Linux/UNIX shell and Database: Oracle, MS-SQL, PostgreSQL, no SQL DB (Casandra)
•    Telemetry & Application Performance monitoring tools such as: Splunk, AppDynamics, Dynatrace, Grafana, ITRS Geneos 
•    Experience with core AWS services such as: EC2, S3, EKS, RDS, Cloudwatch (& DataDog) 
•    Exposure to agile methodologies such as Continuous Integration (CI) and Continuous Delivery (CD) tools like Jenkins and Terraform
•    Experience in technology disaster recovery planning and test execution and prior experience in JAVA development is a plus. 

 

Confirm your E-mail: Send Email