Sr. Site Reliability Engineer duties including building and enhancing the tooling needed to deploy and operate Hadoop clusters at scale, across multiple data centers, and cloud providers. Responsible for building tools to maintain the health and operations of the company data infrastructure. Provides problem resolution services. Designs and implements scalable data platforms for customer facing services. Deploy and scale Hadoop Infrastructure, capacity planning, data cluster monitoring and troubleshooting, drive operational enhancements and participate in the complete product life-cycle.
Utilizes tools and technologies including Linux operating system and a vast array of other technologies.
Master's degree in Computer Science, Engineering, Information Technology, or related field with 2 years experience or Bachelor's degree in Computer Science, Engineering, Information Technology, or related field with 5 years experience. Academic training or experience must include some exposure to:
2 years' experience designing and implementing scalable data platforms.
1 year experience deploying and scaling Hadoop Infrastructure
1 year data cluster monitoring and troubleshooting.
2 years' with OS integration and application installation.
2 years'programming with two of the following: Java, Perl, Python.
2 years' installing, configuring Linux based systems.
1 years' scripting for automation and coding management with one of the following: Chief Puppet)
1 year managing Hadoop and its ecosystem - one of the following: Hive, Pig, Spark, Flume, Zookeeper
Primary Location City/State:
Additional Locations (if applicable):
Acxiom is an affirmative action and equal opportunity employer (AA/EOE/W/M/Vet/Disabled) and does not discriminate in recruiting, hiring, training, promotion or other employment of associates or the awarding of subcontracts because of a person's race, color, sex, age, religion, national origin, protected veteran, disability, sexual orientation, gender identity, genetics or other protected status.