Have you ever wanted to know how Amazon EC2 works? Do you want to help build the next generation of web-scale infrastructure services that enables companies of all sizes, ranging from start-ups to large enterprises to run on EC2?
AWS Elastic Compute Cloud (EC2) Windows team is looking for experienced data engineer who can use the immense data that EC2 Windows team generates in order to provide superior value and better experience to our customers, conduct statistical analysis and design machine learning models.
As a Data Engineer you will be working large and complex data warehouse environments. You should be passionate about working with huge data sets and be someone who loves to bring datasets together to answer business questions. You should have deep expertise in creation and management of datasets. You will build data analytical solutions that will address increasingly complex business questions for AWS EC2.
You should be expert at implementing and operating stable, scalable data flow solutions from production systems into end-user facing applications/reports. These solutions will be fault tolerant, self-healing and adaptive. You will be working on developing solutions that provide some of the unique challenges of space, size and speed. You will implement data analytics using cutting edge analytics patterns and technologies that are inclusive of but not limited to various AWS Offerings - EMR, Lambda, Kinesis, and Spectrum. You will extract huge volumes of structured and unstructured data from various sources (Relational /Non-relational/No-SQL database) and message streams and construct complex analyses and feed into machine learning models. You will create machine learning models, write scalable code and tune performance running over billion of rows of data that help build scalable machine learning models. You will implement data flow solutions that process data on Spark, Redshift and store in Redshift, Filebased system (S3) for reporting and adhoc analysis.
You should be detail-oriented and must have an aptitude for solving unstructured problems. You should work in a self-directed environment, own tasks and drive them to completion.
You should apply Statistical, Data Science, Machine Learning or other innovative methods to specific business problems and data. You should have excellent business and communication skills to be able to work with business owners to develop and define key business questions and to build data sets that answer those questions. You own customer relationship about data and execute tasks that are manifestations of such ownership, like ensuring high data availability, low latency, documenting data details and transformations and handling user notifications and training.
Amazon is an electronic commerce and cloud computing company.