Logo of Huzzle

Lead Data Engineer - ML/GCP

Applications are closed

  • Job
    Full-time
    Senior (5-8 years)
  • Irving

Requirements

  • 7+ years of progressively complex related experience in cloud data engineering and data analysis.
  • 7+ Years of experience in Data Engineering, Analytics and Machine Learning Systems.
  • 3+ years of building cloud native analytical products in GCP, Azure or AWS
  • Sound knowledge in any of cloud Technology is must preferably Google cloud Platform (GCP).
  • Deep Knowledge of Large Scale distributed Data Architecture and Performance Optimization Techniques.
  • Proficiency in developing Complex Data Pipelines, ETLs and Workflows on Cloud Platform optimized for High Volume of Health Care data.
  • Proficiency in using Cloud Platforms such as GCP/Azure/AWS. Knowledge of tools like Composer, Kafka, PySpark and SQL.
  • Knowledge of Programming Language Java/Python.
  • Proficiency with CI/CD tooling like Jenkins, GitHub to enable robust development pipelines for data and ML.
  • Strong knowledge of large-scale search applications and building high volume data pipelines, preferably using PySpark on GCP and It’s native tools such as BigQuery, Airflow, Composer, DataProc, PUB/SUB, DataFlow and Vertex AI.
  • Strong Foundational Knowledge in Agile Methodologies.
  • Experience with Healthcare domain is highly desirable.
  • Deep understanding of data warehousing, data architecture, and data modeling methods & best practices.
  • Understanding of AI/ML technology Stack.
  • Comfortable working experience with large scale LLMs.
  • Exposure in implementing Gen AI and/or NLP based solutions using LLMs.
  • Sound experience in Google Cloud Data Services: Big Query, Data Proc, PubSub, Cloud Functions, Cloud Storage, Dataflow, Composer.
  • Bachelor's degree or equivalent work experience in Mathematics, Statistics, Computer Science, Business Analytics, Data Science, Engineering, or related discipline
  • Master’s degree Preferred in Computer Science/ML

Responsibilities

  • As a technical leader, provide guidance to a team of data engineers and collaborate closely with data scientists and analysts to support data-driven decision-making.
  • Analyzes complex data structures from disparate data sources and design large scale data engineering pipeline.
  • Develops large scale data structures and pipelines to organize, collect and standardize data that helps generate insights and addresses reporting needs.
  • Implement data ingestion pipeline using APIs, third party tools, or create custom codes to ingest high volume data into Cloud environment.
  • Writes processes, designs database systems and develops tools for real-time and offline analytic processing.
  • Collaborates with product business and data science team to collect user stories, translate into technical specifications, and implement data transformation, algorithms and models into automated processes.
  • Uses strong programming skills in PySpark, Python/Java or any of the major languages to build robust data pipelines and dynamic systems.
  • Builds highly scalable and extensible data marts and data models to support Data Science and other internal customers on Cloud. Integrates data from a variety of sources, assuring that they adhere to data quality and accessibility standards.
  • Analyzes current information technology environments to identify and assess critical capabilities and recommend solutions.
  • Build and facilitate machine learning across large scale systems and campaigns and ensure deployment and updates in partnership with data engineering.
  • Develop and participate in presentations and consultations to existing and prospective constituents on analytics results and solutions.
  • Interact with internal and external peers and managers to exchange complex information related to areas of specialization.
  • The ideal candidate should be detailed oriented with the ability to quickly understand complex situations, manage multiple urgent tasks at the same time and have a proven track record of communicating in an open and honest way that quickly builds trust and respect.
  • The ideal candidate should work with data related to a wide range of customer interactions and analytics. This role will be responsible for designing and deploying large scale ML models with support from data engineering and Product Team.
  • Experiments with available tools and advice on new tools to determine optimal solution given the requirements dictated by the model/use cases.

FAQs

What is the primary purpose of the Lead Data Engineer position at CVS Health?

The primary purpose of the Lead Data Engineer position is to provide technical leadership in data engineering and collaborate with data scientists and analysts to support data-driven decision-making, contributing to enhancing human-centric health care.

What are the key responsibilities of the Lead Data Engineer?

Key responsibilities include designing large-scale data engineering pipelines, implementing data ingestion processes, collaborating with product and data science teams, building scalable data models, and facilitating machine learning initiatives across large systems and campaigns.

What qualifications are required for this role?

Required qualifications include 7+ years of experience in cloud data engineering and analytics, 3+ years building cloud-native analytical products, proficiency in data pipeline development, and strong programming skills in languages such as Python or Java. Knowledge of Google Cloud Platform is essential.

Is experience in the healthcare domain necessary for this position?

While not mandatory, experience in the healthcare domain is highly desirable as it aligns closely with the responsibilities and data types pertinent to the role.

What level of education is preferred for candidates applying for this position?

A Bachelor's degree in Mathematics, Statistics, Computer Science, Business Analytics, Data Science, Engineering, or a related discipline is required, and a Master's degree in Computer Science or Machine Learning is preferred.

What technical skills are essential for the Lead Data Engineer role?

Essential technical skills include proficiency in developing complex data pipelines, familiarity with cloud platforms (especially GCP), knowledge of tools like BigQuery, Airflow, and Composer, and experience with CI/CD tooling like Jenkins and GitHub.

What is the pay range for this position?

The typical pay range for the Lead Data Engineer position is $118,450.00 - $236,900.00. The actual salary offer may vary based on factors such as experience, education, and geography.

What benefits does CVS Health offer to its employees in this role?

CVS Health offers a full range of benefits including medical, dental, and vision coverage, a 401(k) retirement savings plan, stock purchase options, life insurance, disability benefits, well-being programs, education assistance, and Paid Time Off (PTO), among others.

When does the application window for this position close?

The application window for this opening will close on 10/28/2024.

Will CVS Health consider qualified applicants with arrest or conviction records?

Yes, qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws.

Bringing our heart to every moment of your health.

Science & Healthcare
Industry
10,001+
Employees
1963
Founded Year

Mission & Purpose

CVS Health is a healthcare innovation company that operates retail pharmacies, manages pharmacy benefits, and provides health services through its MinuteClinic and HealthHUB locations. Their ultimate aim is to improve the quality of life for communities by making healthcare more accessible and affordable. CVS Health focuses on driving healthier outcomes and reducing healthcare costs, using its comprehensive range of services and products to support individuals on their health journey.

Get notified when CVS Health posts a new role

Get Hired with Huzzle

Discover jobs with AI-powered precision. Autofill and track applications, create tailored resumes, and find the best opportunities across the web – all by simply chatting.

Already have an account?