Senior Software Engineer I, Cloud and DevOps

Generate BiomedicinesSomerville, MA
10hOnsite

About The Position

Generate:Biomedicines is a new kind of therapeutics company – existing at the intersection of machine learning, biological engineering, and medicine – pioneering Generative Biology™ to create breakthrough medicines where novel therapeutics are computationally generated, instead of being discovered. The Company has built a machine learning-powered biomedicines platform with the potential to generate new drugs across a wide range of biologic modalities. This platform represents a potentially fundamental shift in what is possible in the field of biotherapeutic development. We pursue this audacious vision because we believe in the unique and revolutionary power of generative biology to radically transform the lives of billions, with an outsized opportunity for patients in need. We are seeking collaborative, relentless problem solvers that share our passion for impact to join us! Generate:Biomedicines was founded in 2018 by Flagship Pioneering and has received nearly $700 million in funding, providing the resources to rapidly scale the organization. The Company has offices in Somerville and Andover, Massachusetts with 300+ employees. You will join a collaborative platform team responsible for the cloud and DevOps foundations that support Generate:Biomedicines’ research at the intersection of machine learning and computational biology. The team delivers and evolves shared capabilities across accelerated computing, Kubernetes, CI/CD, and observability so that research and product teams can run workloads reliably, securely, and cost effectively as the organization scales. We are looking for a senior engineer with strong technical judgment, a collaborative approach, and a focus on delivering outcomes. You will assess complex systems, make thoughtful decisions, and partner with others to execute effectively. In this role, you will help identify where automation and well-designed paved paths create the most leverage, and contribute pragmatic solutions that reduce toil, improve performance, and make safe defaults the easiest choice for teams. If you enjoy turning ambiguity into progress and applying emerging technologies, including agent-assisted workflows, to real operational problems, this role offers meaningful scope and impact. This role will be onsite in our Somerville, MA location and require 2+ days/week in the office.

Requirements

  • 5+ years of relevant engineering experience
  • Foundations in Kubernetes and Linux operations: experience working with Kubernetes in production or production adjacent environments, and familiarity with troubleshooting networking or performance issues, upgrades, or migrations. Deep expertise across all areas is not required on day one.
  • Exposure to cloud and networking concepts: hands-on experience with at least one major cloud provider and an interest in learning more complex networking and hybrid connectivity patterns over time, including GPU focused environments and AWS.
  • An automation-oriented mindset: experience improving workflows, reducing manual effort, or introducing guardrails, along with curiosity about modern DevOps practices that help platform systems scale beyond individual contributors.
  • Interest in observability and cost awareness: some experience with dashboards, alerting, tracing, or telemetry, and a willingness to learn how to use signals to improve system behavior, performance, and cost efficiency.
  • Experience with containers and delivery practices: experience building or modifying container images and deployment workflows, and an interest in learning best practices around image hardening, reproducibility, and reliable delivery.

Nice To Haves

  • Familiarity with tools like DataDog, Grafana, or Prometheus is helpful but not required.

Responsibilities

  • Build and evolve our Kubernetes and compute platform: operate and improve shared clusters and associated tooling that support internal services, CI runners, and compute-heavy research workflows. This includes participating in upgrades, capacity planning, GPU scheduling, and incident response with clear escalation paths and shared on-call practices.
  • Drive automation-first DevOps: work with partner teams to reduce manual operations by improving deployment patterns, self-service capabilities, and operational guardrails, enabling teams to ship and run reliably with fewer one-off interventions.
  • Improve observability and performance through practical signals: design and iterate on dashboards, alerting, and instrumentation practices, including performance tuning loops, that help teams understand workload behavior, detect issues early, and make informed tradeoffs around efficiency and cost.
  • Strengthen infrastructure governance and delivery systems collaboratively: contribute to well-structured IaC workflows and change management practices, such as Terraform with review and apply processes, and help improve CI/CD reliability so infrastructure and application changes are safe, auditable, and timely.
  • Be a trusted teammate on a small platform group: collaborate closely through pairing, reviews, documentation, and shared ownership, and help build durable operational readiness through runbooks, training, and clearly defined operational standards.

Benefits

  • eligible for an annual bonus
  • equity compensation
  • competitive benefits package
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service