Job Directory Software Engineer, Data Platform Reliability

Software Engineer, Data Platform Reliability
San Francisco, CA

Companies like
are looking for tech talent like you.

On Hired, employers apply to you with up-front salaries.
Sign up to start matching for free.

About

Job Description

At Lyft, community is what we are and it's what we do. It's what makes us different. To create the best ride for all, we start in our own community by creating an open, inclusive, and diverse organization where all team members are recognized for what they bring.

Passengers rely on Lyft to get to work, to go to the doctor, or to get home safely when public transit has stopped running. Building a stable and dependable application for our passengers and drivers is a responsibility we take very seriously, and we are building out a team of Software Engineers focused on reliability , to support and highly reliable user experience.

This Reliability Software Engineering (RSWE) team will work on standardizing and supporting all of the growing teams throughout our organization, assessing their architecture, helping them design scalable services, and fostering excellent operational practices. It's an essential role of ensuring that our systems are always healthy, monitored and automated.

Data is the core of our business at Lyft helping us create a remarkable transportation experience for our customers and providing insights into the effectiveness of our products. To support this, we operate an extensive big data infrastructures in the AWS cloud. In addition to relying on big data compute engines like Hive and Presto, we also build an ecosystem of tools and services that allow all Lyft teams to support the platform as a cohesive service. Along with that, we are building a next-generation streaming platform based on Apache Flink and Apache Kafka. Our platform runs thousands of jobs, processes billions of events, and we support hundreds of data analysts and engineers across the company.

As a member of the diverse RSWE team, you will embed with the engineers in Data Platform to develop a reliable data infrastructure that scales with our incredible growth.

Responsibilities:

* Build holistic visibility into SLIs, SLOs, SLAs, dependency graphs, past performance of jobs and systems load to bring much-needed clarity to job projects.
* Build infrastructure and create projects that break things with the aim to improve the production systems.
* Use the core Site Reliability Engineering principles of change monitor and manage, emergency response, capacity planning, and production readiness reviews to run the Data Platform Infrastructure.
* Step back to observe patterns and develop thoughtful tools and automation to minimize toil. Use those learnings to create the best operational practices.
* Partner with the broader Lyft organization to build the culture of rigorously learning from incidents.

Experience:

* Substantial programming experience in Python or Go
* Passion for building tools and automation to make infrastructure more vigorous
* Experience working with public cloud platforms (e.g., AWS, Google Cloud Platform, Microsoft Azure, etc.)
* Experience designing, debugging and running fault tolerant large-scale distributed systems
* Hands-on experience with Hadoop (or similar) ecosystem - Yarn, Hive, HDFS, Spark, Presto, Parquet, HBase, Flink, Kafka, Kinesis a plus

Benefits:

* Great medical, dental, and vision insurance options
* In addition to 11 observed holidays, salaried team members have unlimited paid time off, hourly team members have 15 days paid time off
* 401(k) plan to help save for your future
* 18 weeks of paid parental leave. Biological, adoptive, and foster parents are all eligible
* Monthly commuter subsidy to cover your transit to work
* 20% off all Lyft ride

Lyft is an Equal Employment Opportunity employer that proudly pursues and hires a diverse workforce. Lyft does not make hiring or employment decisions on the basis of race, color, religion or religious belief, ethnic or national origin, nationality, sex, gender, gender identity, sexual orientation, disability, age, military or veteran status, or any other basis protected by applicable local, state, or federal laws or prohibited by Company policy. Lyft also strives for a healthy and safe workplace and strictly prohibits harassment of any kind. Pursuant to the San Francisco Fair Chance Ordinance and other similar state laws and local ordinances, and its internal policy, Lyft will also consider for employment qualified applicants with arrest and conviction records.

Let your dream job find you.

Sign up to start matching with top companies. It’s fast and free.