Our Enterprise Analytics team is searching for data engineers who can contribute to our mission of expanding access to high quality, cost-effective health care and equipping our members with information and tools so they can make the best health care decisions for themselves and their families. Enterprise Analytics applies data science and machine learning to make healthcare more cost-effective and to improve outcomes for our members. We analyze a variety of structured and unstructured data, and apply state-of-the-art computational methods to add value.
We are actively recruiting data engineers to provide reliable data access patterns for our team's modeling efforts, and to contribute to tooling used enterprise-wide. Success in this role will require:
* Experience with data analysis and relational-style query languages * Familiarity with data pipelining and ETL * The ability to work with semi structured and unstructured data * Familiarity with general-purpose programming and shell scripting
This position is responsible for ensuring data quality, creation of new data pipelines, optimization and management of existing data pipelines, ingestion and curation of data sources for analytical purposes, and transformations of structured and unstructured data into formats suitable for machine learning and advanced analytics. As data science products are operationalized, they will also work on deployment and making sure products are production-ready and run smoothly. They will work closely with IT on implementation of data products where traditional application development support is needed.
* Bachelor degree and 5 years of work experience in a computer science, engineering, or related field OR Master's degree and 4 years of work experience in a computer science, engineering, or related field OR Ph.D. and 2 years of work experience in a computer science, engineering, or related field * Learning and growth mindset. * Customer-focused. * Interpersonal, verbal and written communication skills. * Must demonstrate proficiency in at least five and mastery in one of the following six areas: 1) data analysis and relational-style query languages; 2) data pipelining and ETL; 3) working with semistructured and unstructured data; 4) a high-level programming language; 5) distributed computing; 6) understanding of healthcare. * Proficiency in iterative development practices. * A track record of independently delivering or leading the delivery of data engineering solutions for multiple complex analytics or data science projects and products.
* Experience with the following tools and skills: * SQL, Hive and Python * Distributed computing frameworks such as Spark and Dask * Cloud computing frameworks such as AWS, Azure and GCP * Tools for source code management and continuous integration * Methods for analytical data preprocessing (including missing value imputation, scaling and feature engineering)
This position is based in Chicago, IL.
Keywords: data science, engineer, data engineer, hive, azure, distributed computing
HCSC is committed to diversity in the workplace and to providing equal opportunity and affirmative action to employees and applicants. We are an Equal Opportunity Employment / Affirmative Action employer dedicated to workforce diversity and a drug-free and smoke-free workplace. Drug screening and background investigation are required, as allowed by law. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or protected veteran status.
Requirements: Expertise Information Technology Job Type Full-Time Regular Location IL - Chicago
Let your dream job find you.
Sign up to start matching with top companies. It’s fast and free.