In the Azure DevOps team, Ops is a critical part of who we are. We are looking to make our operational engineering and infrastructure the best in the industry to improve our customers' experience and our velocity of innovation. We are looking for a Senior Software Engineer who is passionate about writing code, solving hard problems in large-scale production, and applying the principles of Site Reliability Engineering to production services. In this this position you will focus on improving scale, reliability, and operational efficiency of one of the world's largest DevOps service providers.
Our team consists of Software and Site Reliability Engineers focused on solving operational problems through software in our services and ensuring our services run reliably at scale.
Learn more about Azure DevOps and what we build here: https://azure.microsoft.com/en-us/services/devops/
What makes this a great place to work?
There are many things, but here are the highlights:
We are our own customer. We are the first ones to use everything we build.
The team is full of smart people who care about the work they do. They are also a pleasure to work with - We thrive on comradery, helping each other and having fun together.
We run a hosted service which means that we ship new features daily. The work you do has immediate benefits to customers.
5+ years' experience developing production software or optimizing automation, reliability, and monitoring of production services
Bachelor's degree in computer science, mathematics, engineering or equivalent experience
2 or more years' experience with one or more of the following: C#, ASP.NET, Web Services, SQL, Azure, PowerShell, Kubernetes
Demonstrated design skills for large scale and highly available cloud services or distributed systems
Troubleshooting skills across network, application, caching, queuing, load-balancing, storage and distributed services layers.
Excellent analytical skills as well as communication skills both verbal and written
Strong customer focus and data driven approach
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:
Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Key team responsibilities:
Build solutions to enable efficient and automated operations of Azure services at scale. Eliminate manual toil with well-engineered solutions that increase development velocity and security for dozens of services and hundreds of engineers.
Tackle hard production problems with service reliability, latency, and availability and develop solutions that enable others to do the same.
Collaborate with other engineers to design and deliver solutions for automatic mitigation, telemetry & monitoring, disaster recovery, capacity management and incident management.
Collaborate with other Azure platform teams to improve service reliability, latency, and troubleshooting.
Participate in an on-call rotation and contribute to a culture of live site and customer focus through on call engineer training, high-quality incident postmortems, and leading by example.
Microsoft is a technology company that develops and supports software, services, and devices.