Job Directory Principal Software Architect

Principal Software Architect
Allen, TX

Companies like
are looking for tech talent like you.

On Hired, employers apply to you with up-front salaries.
Sign up to start matching for free.

About

Job Description

Experian is seeking a Principal Site Reliability Engineer to join our CIS SRE team. The SRE team runs, maintains and improves the Big Data Platform against established Service Level Objectives by applying software engineering practices. It is responsible for the availability, performance, change management, monitoring, and capacity management of their services, with special emphasis being placed on the automation of the processes/workload in support of the above. The SRE team is also responsible for the operational support of the Big Data infrastructure, with emphasis being placed on the ability to submit outage/issue/incident data into a design and SDLC feedback loop to ensure maximum automation and outage avoidance.

About Experian

Experian is the world's leading global information services company, unlocking the power of data to create more opportunities for consumers, businesses and society. For five years in a row, we have been named in the Top 100 "World's Most Innovative Companies" by Forbes Magazine. With a focus on our employees, we were rated the #1 Top Workplace by the Orange County Register. Experian Consumer Information Services is redefining the way our clients do business within all aspects of the customer credit lifecycle. Fueled by best-in-class data and innovative technology we help businesses make smarter decisions, identify consumers, make decisions on loans, market to prospects and collect.

About this role

As a Principal SRE you will lead, design and build next generation large scale cloud bigdata platform. You will be working on the core compute platforms in the cloud. You will ensure that our software products delivered to our clients are always available and performant under exponential growth conditions. You'll partner with fellow engineers to architect and build mission critical cloud based bigdata systems that can stand the test of scale and availability, while limiting operational overhead. You will drive efficiencies in systems and processes such as capacity planning, configuration management, performance tuning, monitoring and root cause analysis.

* Design, write, and deliver software to improve the availability, scalability, latency, and efficiency of our bigdata platform.
* Manage end-to-end availability and performance of mission critical services and build automation to prevent problem recurrence. Build automated responses for non-exceptional service conditions.
* Assist in building out the DevOps strategy for hosting and managing our data analytics platforms in AWS cloud
* Strong track record of design/implementing cloud infrastructure solutions on distributed computing platforms in large enterprises where scale and complexity have been tackled.
* Proven experience participating in agile development projects for enterprise-level systems component design and implementation
* Deep understanding and application of enterprise software design for implementation of data services and middleware.

What your background looks like:

* You are passionate about building and running distributed systems at scale in production. You understand the challenges and trade-offs to be made when building and deploying systems to production.\
* Expertise in working with container deployment and orchestration technologies at scale with strong knowledge of the fundamentals to include service discovery, deployments, monitoring, scheduling, load balancing.
* Deep understanding of Systems programming (network stack, file system, OS services) and networking (L2 vs. L3, network architecture, VLANs, etc)
* Knowledge of best practices related to security, performance, and disaster recovery.
* Highly skilled in identifying performance bottlenecks, identifying anomalous system behavior, and determining the root cause of incidents.
* You've demonstrated the ability to effectively work collaboratively across functions.
* You are enthusiastic about making the many users of your product happier every day.
* Interest in designing, analyzing and troubleshooting large-scale distributed systems.
* Systematic problem-solving approach coupled with strong communication skills and a sense of ownership and drive.
* Ability to debug and optimize code and automate routine tasks.

Perks

* Four weeks of vacation to start, five sick days and two volunteer days (plus eleven paid holidays)
* This is a bonus eligible position, with a bonus target of 20%
* Employee stock purchase program and 401K matching
* Wellness initiatives, online discounts, employee discounts, pet insurance and more

EOE including Disability/Veteran

Let your dream job find you.

Sign up to start matching with top companies. It’s fast and free.