The CombinAItorial Sensor Design project is part of HHMI’s AI for Science Initiative (ai.hhmi.org) and brings together expertise in protein engineering, advanced microscopy, and machine learning. Our goal is to develop a protein biosensor optimization pipeline that integrates high-throughput functional screening with predictive deep learning to accelerate the directed evolution of protein biosensors for visualizing dynamic biochemical processes in living cells. We are seeking a highly skilled AI Software Engineer to join our team and play a crucial role in advancing our AI-driven scientific initiatives. In this position, you will be responsible for developing and maintaining the computational infrastructure essential for AI-powered biosensor optimization. You will collaborate with data scientists and experimentalists to develop robust data flows from optical pooled screening outputs through to model training and deployment. You will implement cutting-edge tools for predicting fluorescence properties and biochemical performance based on protein sequence and structure. You will utilize generative models to produce new sequences for biosensor candidates that will be tested in the lab. This role will require deep knowledge of the underlying models as well as practical implementation skills to have the maximum biological impact. You will lead comparative studies, implement novel architectures, and ensure all work meets the highest standards of reproducible, open science. This role requires close collaboration with our microscopy, sequencing, and protein engineering team to ensure the seamless integration of computational and experimental workflows. Strong programming skills in Python, PyTorch, and JAX are required, along with the ability to reason about neural network behavior from first principles. We seek candidates who can think critically about model design, understand how architectural choices and regularization affect model behavior, and design rigorous experiments to evaluate models. Domain expertise in sequence or protein structure analysis will be highly valued. Because this is a team project, we value a clean shared codebase and git-based collaborative workflows. Familiarity with protein modeling or machine learning frameworks such as AlphaFold, ESM, Chai-1, or Boltz-1 is highly valued. We are looking for candidates with experience in ML model deployment, workflow orchestration, and high-throughput data processing, as well as experience working with large biological datasets in GPU-based computing environments.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
Ph.D. or professional degree
Number of Employees
5,001-10,000 employees