Genentech-posted 3 months ago
$147,600 - $274,000/Yr
Daly City, CA
5,001-10,000 employees

A healthier future. It’s what drives us to innovate. To continuously advance science and ensure everyone has access to the healthcare they need today and for generations to come. Creating a world where we all have more time with the people we love. That’s what makes us Roche. Advances in AI, data, and computational sciences are transforming drug discovery and development. Roche’s Research and Early Development organisations at Genentech (gRED) and Pharma (pRED) have demonstrated how these technologies accelerate R&D, leveraging data and novel computational models to drive impact. Seamless data sharing and access to models across gRED and pRED are essential to maximising these opportunities. The new Computational Sciences Center of Excellence (CoE) is a strategic, unified group whose goal is to harness this transformative power of data and Artificial Intelligence (AI) to assist our scientists in both pRED and gRED to deliver more innovative and transformative medicines for patients worldwide.

  • Design, implement, and maintain scalable and reliable ML infrastructure on AWS.
  • Automate deployment, monitoring, alerting, and operational tasks using tools like Terraform and Helm.
  • Manage and optimize CI/CD pipelines and Git repositories for ML projects, ensuring efficient version control to support collaboration and deployment.
  • Collaborate closely with ML engineers and data scientists to understand their infrastructure needs and provide solutions.
  • Troubleshoot and resolve infrastructure-related issues in a timely manner.
  • Implement and enforce security best practices for ML infrastructure.
  • Document infrastructure designs, processes, and operational procedures.
  • Contribute to initiatives independently as part of a team, delivering assigned outputs.
  • Proactively identify issues and gaps, proposing ideas and suggestions for improvements.
  • Proven experience in designing, deploying, and managing infrastructure on Amazon Web Services (AWS), including services such as EC2, S3, RDS, EKS, SageMaker, etc.
  • Strong proficiency with Git and Git repository management.
  • Hands-on experience with Terraform for infrastructure provisioning and management.
  • Experience with Helm for deploying and managing applications on Kubernetes.
  • Proficiency in scripting languages (e.g., Python, Bash) for automation.
  • Excellent problem-solving skills and a strong ability to debug complex issues.
  • Strong communication and interpersonal skills to effectively collaborate with cross-functional teams and user-facing interactions.
  • Demonstrated ability to take initiative, anticipate needs, and drive projects to completion.
  • Ability to thrive in a fast-paced environment and adapt to evolving requirements while adhering to corporate guidelines.
  • Ability to write clean code with little syntax/convention feedback.
  • Applies software engineering best practices (linting automation, unit testing, documentation, CI/CD).
  • Familiarity with modern machine learning methods.
  • Knowledge of and experience with high-performance computing, distributed systems, and cloud computing.
  • Experience with MLOps platforms and tools.
  • Familiarity with CI/CD pipelines for ML workflows.
  • Knowledge of monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack).
  • Relocation benefits are available for this job posting.
  • A discretionary annual bonus may be available based on individual and Company performance.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service