Data Engineer

CVS HealthHartford, VT
18hHybrid

About The Position

We’re building a world of health around every individual — shaping a more connected, convenient and compassionate health experience. At CVS Health®, you’ll be surrounded by passionate colleagues who care deeply, innovate with purpose, hold ourselves accountable and prioritize safety and quality in everything we do. Join us and be part of something bigger – helping to simplify health care one person, one family and one community at a time. Position Summary If you’re eager to make a real impact in the health care industry through your own meaningful contributions, explore a role in technology with CVS Health. Our journey calls for technical innovators and data visionaries: come help us pave the way. At CVS Health, we possess an extensive repository of healthcare data that spans over 150 million individuals, providing an unparalleled foundation for ambitious Data Engineers. In this role, you will engage with complex business challenges, harnessing modern tools and technologies to securely store, process, transform, and enrich terabyte to petabyte scale healthcare data. Your work will underpin data-driven business decisions and contribute to our mission of delivering industry-best data products / software with a customer-first mindset and team-oriented approach. As a Data Engineer, you will be instrumental in designing, developing, and maintaining optimal data pipelines to assemble large and intricate datasets, catering to the business requirements of various CVS lines of business. Collaborating closely with teams, you will craft tools to provide actionable insights and integrate them with consumer touchpoints. As leaders in healthcare, our analytics and engineering teams deliver innovative solutions to business problems by collaborating with cross-functional teams in a dynamic and agile environment. You will be part of a team that values collaboration and encourages innovative thinking at all levels. You will be intellectually challenged to solve problems associated with large scale complex, structured and unstructured data, that will allow you to grow your technical skills and engineering expertise. This role may work onsite 2 days a week in one of CVS Health's core hubs in: NYC, Hartford, CT or Wellesley, MA. Full-time Remote candidates will also be considered.

Requirements

  • 1+ years of experience with SQL, NoSQL
  • 1+ years of experience with Python (or a comparable scripting language)
  • 1+ years of experience with Data warehouses (such as data modeling and technical architectures) and infrastructure components
  • 1+ years of experience with ETL/ELT, and building high-volume data pipelines
  • 1+ years of experience with reporting/analytic tools
  • 1+ years of experience with query optimization, data structures, transformation, metadata, dependency, and workload management
  • 1+ years of experience with Big data and cloud architecture
  • 1+ years of hands-on experience building modern data pipelines within a major cloud platform (GCP)
  • 1+ years of experience with deployment/scaling of apps on containerized environment (i.e. Kubernetes, AKS)
  • 1+ years of experience with real-time and streaming technology (i.e. Kafka, Spark Streaming)
  • 1+ year(s) of soliciting complex requirements and managing relationships with key stakeholders

Nice To Haves

  • Experience with complex systems and solving challenging analytical problems
  • Strong collaboration and communication skills within and across teams
  • Knowledge of data visualization and reporting
  • Experience in designing and building data engineering solutions in cloud environments (preferably GCP)
  • Experience with Git, CI/CD pipeline, and other DevOps principles/best practices
  • Experience with bash shell scripts, UNIX utilities & UNIX Commands
  • Understanding of software development methodologies including waterfall and agile
  • Ability to leverage multiple tools and programming languages to analyze and manipulate data sets from disparate data sources
  • Knowledge of API development
  • Experience with schema design and dimensional data modeling
  • Google Professional Data Engineer Certification
  • Knowledge of microservices and SOA
  • Formal SAFe and/or agile experience.
  • Previous healthcare experience and domain knowledge
  • Experience designing, building, and maintaining data processing systems
  • Experience architecting and building data warehouse and data lakes

Responsibilities

  • Build and optimize analytical data models in BigQuery.
  • Implement partitioning, clustering, and materialized views for performance and cost efficiency.
  • Ensure compliance with data governance, access controls, and IAM best practices.
  • Develop integrations with external systems (APIs, flat files etc.) using GCP-native or hybrid approaches.
  • Utilize tools like Dataflow or custom Python/Java services on Cloud Functions or Cloud Run to handle transformations and ingestion logic.
  • Build automated CI/CD pipeline using Cloud Build, GitHub Actions, or Jenkins for deploying data pipeline code and workflows.
  • Set up observability using Cloud Monitoring, Cloud Logging, and Error Reporting to ensure pipeline reliability.

Benefits

  • Affordable medical plan options, a 401(k) plan (including matching company contributions), and an employee stock purchase plan.
  • No-cost programs for all colleagues including wellness screenings, tobacco cessation and weight management programs, confidential counseling and financial coaching.
  • Benefit solutions that address the different needs and preferences of our colleagues including paid time off, flexible work schedules, family leave, dependent care resources, colleague assistance programs, tuition assistance, retiree medical access and many other benefits depending on eligibility.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service