Principal Data Engineer

Regeneron Pharmaceuticals
Onsite

About The Position

Principal Data Engineer builds data infrastructure, leads technical initiatives, and mentors junior team members while driving data-driven solutions across the organization. As a Principal Data Engineer, a typical day might include the following: Design complex data engineering solutions and define standards. Architect and optimize secure, scalable pipelines (ETL/ELT) for real-time and batch processing. Integrate diverse data sources, implement fault-tolerant systems, and establish CI/CD practices. Organize large datasets, ensure data quality, and design data lake/warehouse solutions for accessibility. Monitor pipeline performance, troubleshoot issues, and implement observability and alerting systems. Leverage GenAI solutions to enhance team efficiency. Document systems and ensure adherence to governance policies. Mentor junior engineers and drive infrastructure innovation. Regeneron is a leading biotechnology company that invents life-transforming medicines for people with serious diseases. Founded and led for over 30 years by physician-scientists, our unique ability to repeatedly and consistently translate science into medicine has led to nine FDA-approved treatments and numerous product candidates in development, almost all of which were homegrown in our laboratories. Our medicines and pipeline are designed to help patients with eye diseases, allergic and inflammatory diseases, cancer, cardiovascular and metabolic diseases, neurology, infectious diseases and rare diseases. Regeneron is also proud of award winning culture of innovation, being recognized as a Great Place to Work in 2021, Fast Company’s Best Workplace for Innovators in 2020, and Forbes JUST companies in 2020, among others.

Requirements

  • BS/BA in Computer Science, Bioinformatics, or related field
  • 8+ years relevant experience
  • 1+ years experience in biotech, pharmaceutical, or other life sciences industries
  • 3+ years cloud platform experience (AWS, Azure)
  • Experience with workflow orchestration tools (Airflow, Luigi, Prefect, or similar)
  • Experience with containerization technologies
  • Experience with scientific data management systems
  • Experience using GenAI to enhance own work
  • Strong Python, Java, or Scala programming skills
  • Deep SQL expertise and relational database experience
  • NoSQL and big data technology experience (Hadoop, Spark, Kafka)
  • Proficiency in data modeling and schema design
  • Knowledge of data security and compliance requirements in regulated environments
  • Excellent communication skills

Nice To Haves

  • Master's degree in Computer Science, Bioinformatics, or related field preferred
  • Familiarity with Biotech Enterprise Systems (MES, LIMS, QMS)
  • Knowledge of MCP and Orchestration platforms related to AI/GenAI
  • Proficiency in star schemas and data modeling tools
  • Knowledge of industry regulatory requirements (CFR Part 11, GxP, CSA)
  • Stream processing experience (Kafka, Flink)
  • Cloud certifications

Responsibilities

  • Builds data infrastructure
  • Leads technical initiatives
  • Mentors junior team members
  • Drives data-driven solutions across the organization
  • Design complex data engineering solutions and define standards
  • Architect and optimize secure, scalable pipelines (ETL/ELT) for real-time and batch processing
  • Integrate diverse data sources, implement fault-tolerant systems, and establish CI/CD practices
  • Organize large datasets, ensure data quality, and design data lake/warehouse solutions for accessibility
  • Monitor pipeline performance, troubleshoot issues, and implement observability and alerting systems
  • Leverage GenAI solutions to enhance team efficiency
  • Document systems and ensure adherence to governance policies
  • Mentor junior engineers and drive infrastructure innovation

Benefits

  • Comprehensive benefits (vary by location)
  • Health and wellness programs (including medical, dental, vision, life, and disability insurance)
  • Fitness centers
  • 401(k) company match
  • Family support benefits
  • Equity awards
  • Annual bonuses
  • Paid time off
  • Paid leaves (e.g., military and parental leave)
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service