Netflix is one of the world's leading entertainment services with 278 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time.
The Role
How do you spark joy in hundreds of millions of people? It starts with a vision - that technology can give voice to stories around the world. In delivering those much-loved stories, Netflix is responsible for a significant portion of global internet traffic.
To steward that responsibility, we work collaboratively with ISPs to deployOpen Connect, Netflix's Content Delivery Network (CDN),our in-house custom-built network and server infrastructure responsible fordelivering 100% of Netflix's video traffic.
We strive to deliver a great Netflix viewing experience in over 190 countries so our customers can watch whatever, whenever, interruption-free.
We are seeking a Reliability Engineer with extensive experience in *nix, networking, data analysis, and large-scale platform operations experience to design, scale, operate, automate, and analyze our globally distributed CDN. Come join us and play a meaningful role in our journey to entertain the world!
Responsibilities
Drive continual improvement in resiliency, observability, monitoring, instrumentation, and automation with the primary goal to maintain a highly scalable and reliable CDN platform worldwide.
Aggregate, analyze, and correlate large amounts of server and application performance data. Use the innovative Netflix Big Data platform as a highly flexible, specialized and efficient toolset to identify opportunities for platform optimization, system reliability improvements as well as identifying patterns/anomalies for further investigation.
Provide technical design and engineering assistance to ISP partners to integrate our Open Connect Appliances.
Handle Tier 3 escalation and participate in an on-call rotation for the CDN platform production issues.
Qualifications
3+ years Service Reliability/Operational experience running large scale, high performance systems & internet services with focus on performance and reliability.
Preferred - B.S. in Computer Science, Electrical or Computer Engineering (or equivalent professional experience)
Strong working knowledge of networking concepts and application protocols, especially TCP/IP, BGP, DNS, TLS, and HTTP/S with focused experience on CDNs and HTTP cache/proxy technologies.
Skilled in designing, creating and maintaining automation written in a programming language such as Python.
Expert-level knowledge managing and debugging Unix/Linux systems (engineering fundamentals, networking, storage, operating systems) at scale.
Experience with distributed analytic processing technologies (Hive, Presto/Trino, Spark SQL, etc)
Strong understanding of applied statistics and the ability to code systems that identify outlier behavior in large systems.
Some experience with container and container orchestration technologies (Docker, Kubernetes).
Ability to work in a highly collaborative environment and to communicate cross functionally with internal and external partners.
Things that show how we think
FreeBSD optimization used by Netflix to serve 800 Gb/s from a single server
Resiliency Practices in Managing CDN
Measuring Real-Life Latency of the Internet: A Netflix Story
Mastering Near-Real Time Telemetry and Big Data
Does this sound interesting? Or does this sound interesting but intimidating? Please don't self-select; let's figure it out together. We'd love to talk to you!
Netflix is a global company with a diverse member base, which is why the content we produce reflects global perspectives and global stories. As we grow globally, we must have the most talented employees with diverse backgrounds, cultures, perspectives, and experiences to support our innovation and creativity. We are an equal opportunity employer and strive to build balanced teams from all walks of life.
Our culture is unique, and we tend to live by our values, so it's worth learning more about Netflixhere.
At Netflix, we carefully consider a wide range of compensation factors to determine your personal top of market. We rely on market indicators to determine compensation and consider your specific job, skills, and experience to get it right. These considerations can cause your compensation to vary and will also be dependent on your location. The overall market range for roles in this area of Netflix is typically $100,000 - $720,000. This market range is based on total compensation (vs. only base salary), which is in line with our compensation philosophy.
We are an equal-opportunity employer and celebrate diversity, recognizing that diversity of thought and background builds stronger teams. We approach diversity and inclusion seriously and thoughtfully. We do not discriminate on the basis of race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status, or military service.
Job is open for no less than 7 days and will be removed when the position is filled.