Position Purpose:
The Sr. Systems Engineer is responsible for independently developing, maintaining, and supporting The Home Depot's technical infrastructure of hardware and system software that drives the success of Home Depot and our customers. As a Systems Engineer II, you will be part of a dynamic team with engineers of all experience levels who help each other build and grow technical and leadership skills while creating, deploying, and supporting production infrastructure. In addition, Sr. Systems Engineers may be involved in routine upgrades and application support as well as root cause and post-mortem analyses around security incidents and service interruptions.
The ideal candidate possess knowledge in the following areas:
Cloud Platform Knowledge: Proficiency in Google Cloud Platform (GCP) is essential, as many discussions revolve around GCP projects, services, and resources. Infrastructure as Code: Understanding of Terraform and Terraform Enterprise (TFE) is crucial, as there are multiple mentions of Terraform-related issues and configurations.Identity and Access Management (IAM): Knowledge of GCP IAM, including service accounts, roles, and permissions, is frequently needed to address user queries.DevOps and CI/CD: Familiarity with DevOps practices and CI/CD pipelines, including tools like GitHub Actions, is important for troubleshooting deployment issues.Networking and Security: Understanding of cloud networking, firewall rules, and security best practices is necessary to assist with configuration and compliance issues.Database Management: Knowledge of various database types, including SQL and NoSQL, is helpful for addressing data-related queries.Scripting and Automation: Proficiency in scripting languages like Python or Bash is valuable for automating tasks and solving user problems efficiently.Monitoring and Observability: Familiarity with tools like Looker for creating dashboards and monitoring cloud resources is beneficial.Cost Optimization: Understanding of FinOps principles and cloud cost management is important, as there are discussions about reducing wasteful cloud spend.Documentation and Knowledge Management: Skills in creating and maintaining technical documentation, such as ADRs (Architecture Decision Records), are valuable for improving the overall knowledge base.Collaboration Tools: Proficiency in using collaboration platforms like Slack, Jira, and Confluence is essential for effective communication and problem-solving.Containerization: Knowledge of container technologies, particularly in the context of GCP, is helpful for addressing deployment and scaling issues.Key Responsibilities:
30% Delivery & Execution - Keeps abreast of innovations and industry trends as well as changes to internal systems and determines how they impacts tools, training, and support necessary to keep systems up, running, and secure; Participates in and contributes to learning activities around modern systems engineering core practices (communities of practice); Proactively views articles, tutorials, and videos to learn about new technologies and best practices being used within other technology organizations15% Learning - Keeps abreast of innovations and industry trends as well as changes to internal systems and determines how they impacts tools, training, and support necessary to keep systems up, running, and secure; Participates in and contributes to learning activities around modern systems engineering core practices (communities of practice); Proactively views articles, tutorials, and videos to learn about new technologies and best practices being used within other technology organizations20% Planning & Analysis - Researches and analyzes business trends and behavioral data to identify opportunities for improvements and new initiatives; Drives the evaluation, development, and recommendation of specific technology to provide cost-effective solutions that meet THD requirements; Researches and designs best fit infrastructure, network, database, cloud, AI, and security architectures for products; Proactively creates and maintains tools for monitoring and support; Participates in project planning and reporting across multiple efforts35% Support & Enablement - Collaborates with product and project teams to understand needs and enable them with infrastructure; Supports technology architecture design review efforts for project and product teams; Leverages tooling and custom applications to monitor the operational status of applications, infrastructure, networks, databases, and security; optimizes and tunes performance as appropriate; Drives root cause analysis, debugging, support, and post-mortem analysis for security incidents and service interruptions; Maintains, upgrades, and supports existing systems and infrastructure to ensure operational stability; Opens and manages vendor problem tickets to resolution; Drives the production of in-house documentation around solutions; Provides application support for software running in production; Drives moving KB articles to infrastructure as code models; Drives keeping monitoring/alerting up to date
Direct Manager/Direct Reports:
Travel Requirements:
Physical Requirements:
Working Conditions:
Minimum Qualifications:
Preferred Qualifications:
Minimum Education:
Preferred Education:
Minimum Years of Work Experience:
Preferred Years of Work Experience:
Minimum Leadership Experience:
Preferred Leadership Experience:
Certifications:
Competencies: