USA - Remote
2 days ago
Engineering Manager – OC Foundation Infrastructure Services

At Netflix, our mission is to entertain the world. Together, we are writing the next episode - pushing the boundaries of storytelling, global fandom and making the unimaginable a reality. We are a dream team obsessed with the uncomfortable excitement of discovering what happens when you merge creativity, intuition and cutting-edge technology. Come be a part of what’s next.

Open Connect (OC) is a critical group within Netflix that builds and manages a content delivery network (CDN) that delivers all of Netflix’s streaming video. In addition to streaming video, we work on projects within Netflix that leverage physical infrastructure, such as our Cloud Games and Netflix Live and Ads initiatives. According to a 2023 Sandvine report, data delivered by Open Connect accounts for approximately 15% of all downstream traffic volume across the entire internet. All of this traffic is delivered by our edge cache servers.

A small team of talented software engineers develop and maintain the operating system that runs the content caches which deliver Netflix Live, Video-On-Demand , Ads and provide the foundation of the cloud gaming platform. Because of the important role of these servers, it is critical that they be efficient, reliable and secure.

We are looking for an Engineering Manager to lead the effort to build and operate the CI/CD pipelines, automated validation, load testing, and lab infrastructure that support our Linux and FreeBSD based edge platforms, spanning the kernel, OS, firmware/BMC, BIOS, and platform security layers.

In this role, you will have the opportunity to shape a critical part of Netflix’s infrastructure. Your work will be central to keeping our edge fleet secure, performant, and scalable as we expand Streaming, Live, Ads, and Cloud Gaming. 

We provide the freedom to execute, learn and pivot, and the responsibility to be self directed, collaborative and insightful. You can read more about Netflix culture.

In this role, you will:

Lead and grow a team that combines a highly technical CI/CD and validation group with an operations-focused lab group, blending deep systems expertise with developing talent to support our Linux/FreeBSD-based edge platforms.

Own and evolve scalable CI/CD pipelines and automated qualification with clear release/rollback gates, comprehensive testing, and strong observability.

Evolve and operate test and load infrastructure that runs production-like Live, VOD, Ads, and Cloud Gaming workloads and surfaces platform contention.

Manage lab operations and capacity, including hardware provisioning, inventory and ticket workflows, contractor coordination, remote access, and efficient use of lab resources.

Streamline the lab’s security architecture so that access controls, isolation mechanisms, and workflows are tightly aligned with platform security standards and compliance requirements.

Define processes and tooling so lab operations can reliably support CI/CD and validation needs, bringing order and prioritization to a high-volume, often ambiguous request stream.

Partner closely with OS, hardware, application, and security teams to align validation and lab support with their roadmaps, integrate functional and security guardrails into CI/CD, and debug complex regressions of large scale.

Lead the use of AI to develop tools and analysis systems that improve developer productivity, operational triage, and the efficiency of CI/CD, validation, and lab workflows.

Partner in bringing up and evaluation of new devices from CPU, GPU, and other hardware vendors in the lab, enabling analysis, validation, and integration into our edge platforms.

Drive engineering excellence by setting technical standards, defining metrics and SLOs, and using data and postmortems to continually improve systems, practices, and lab operations.

Qualifications:

Experience as an Engineering Manager and a technical leader for highly technical teams in areas such as infrastructure, platform, systems, CI/CD and release engineering.

Excellent communication skills and the ability to translate between deep technical detail and clear business impact.

Deep hands-on background in systems software, with meaningful experience in one or more of: Linux and/or FreeBSD low-level OS development or administration.

Strong experience with CI/CD and test automation, including designing and operating pipelines for complex multi-component systems and using tools such as GitHub Actions, Jenkins, or similar CI systems.

Proven ability to design, build, and operate services that support CI/CD and test automation, delivering reliable, observable, scalable systems and efficient use of compute and hardware resources.

Demonstrated success leading through influence, working cross-functionally with OS, hardware, and security teams, and driving alignment and roadmaps across multiple groups.

You will be successful in this role if you:

Enjoy operating at the intersection of hardware, OS, and platform security, and see infrastructure as a force multiplier.

Can define, scope, and drive cross-functional initiatives from ambiguous problem statements through iterative delivery.

Care deeply about developer experience, and are motivated by enabling other engineers to move faster and safer.

Are comfortable making pragmatic trade-offs between speed, safety, and complexity in CI/CD and validation.

Learn quickly and are energized by ramping up on unfamiliar parts of a complex stack.

Nice to have:

Experience with:

Hardware lab automation, PXE boot, imaging, remote power control, serial access, and rack automation.

Validating BIOS and firmware behavior, managing firmware rollouts.

Boot, networking, and storage internals for Linux and/or FreeBSD

Familiarity with:

Performance tooling such as perf, flamegraphs, bpftrace/eBPF, DTrace, fio, and network benchmarking tools

Using AI tools for operational triage (log clustering, anomaly detection) with clear guardrails and fallbacks

Incident response practices, including postmortems and preventative engineering actions

Contributions to or collaboration with open-source communities

 

Generally, our compensation structure consists solely of an annual salary; we do not have bonuses. You choose each year how much of your compensation you want in salary versus stock options. To determine your personal top of market compensation, we rely on market indicators and consider your specific job family, background, skills, and experience to determine your compensation in the market range. The range for this role is $388,000.00 - $619,000.00. This compensation range will vary based on location.

Netflix provides comprehensive benefits including Health Plans, Mental Health support, a 401(k) Retirement Plan with employer match, Stock Option Program, Disability Programs, Health Savings and Flexible Spending Accounts, Family-forming benefits, and Life and Serious Injury Benefits. We also offer paid leave of absence programs. Full-time hourly employees accrue 35 days annually for paid time off to be used for vacation, holidays, and sick paid time off. Full-time salaried employees are immediately entitled to flexible time off. See more details about our Benefits here.

Netflix is a unique culture and environment. Learn more here.

Inclusion is a Netflix value and we strive to host a meaningful interview experience for all candidates. If you want an accommodation/adjustment for a disability or any other reason during the hiring process, please send a request to your recruiting partner.

We are an equal-opportunity employer and celebrate diversity, recognizing that diversity builds stronger teams. We approach diversity and inclusion seriously and thoughtfully. We do not discriminate on the basis of race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status, or military service.

Job is open for no less than 7 days and will be removed when the position is filled.

Confirm your E-mail: Send Email
All Jobs from Netflix