AI/ML Research Engineer

Manifold Bio•Boston, MA

About The Position

Manifold Bio builds AI models for protein therapeutic design, trained on proprietary experimental data generated at unprecedented scale. Our in vivo-centric discovery platform produces millions of experimentally validated protein designs per campaign, creating the datasets that make our models possible and our approach uniquely powerful. We combine high-throughput protein engineering with computational design to create antibody-like drugs and other biologics. Our world-class team of protein engineers, biologists, and computational scientists are working together to aim the platform at therapeutic opportunities where precise targeting is the key to overcoming clinical challenges. Position Manifold Bio is seeking a talented Machine Learning Research Engineer to join our growing AI team. You will work closely with our research scientists to implement, scale, and optimize machine learning systems that power our de novo antibody design platform and advance our protein design capabilities. Your efforts will contribute to building production-ready ML infrastructure that enables breakthrough discoveries in protein therapeutics. You will be expected to take ownership of engineering challenges in our ML pipeline, from data processing and model training to deployment and monitoring, while collaborating closely with our research team to translate cutting-edge ideas into robust, scalable systems.

Requirements

Bachelor's or Master's degree in Computer Science, Machine Learning, Computational Biology, or related field
2+ years of hands-on experience with PyTorch and/or JAX for deep learning applications
Strong proficiency in Python scientific computing stack (NumPy, Pandas, scikit-learn)
Experience with distributed computing and GPU optimization techniques
Familiarity with protein structure analysis, computational biology, or analogous problems in natural sciences
Understanding of modern deep learning architectures and optimization techniques
Experience implementing research papers or translating ML approaches to production systems
Proficiency with version control (Git), testing frameworks, and software engineering best practices
Strong problem-solving skills and ability to work independently on technical challenges
Excellent written and verbal communication skills for cross-functional collaboration

Nice To Haves

Experience training LLMs or diffusion generative models
Knowledge of cloud computing platforms (AWS, GCP) and containerization (Docker, Kubernetes)
Background in computational biology, bioinformatics, or structural biology
Experience with large-scale data engineering and ETL pipelines
Familiarity with MLOps practices and model deployment frameworks

Responsibilities

Implement and optimize machine learning models for protein design
Build and maintain scalable data processing pipelines for large-scale protein and molecular datasets
Develop and deploy ML infrastructure for distributed training and inference across GPU clusters
Collaborate with research scientists to translate experimental ML approaches into production-ready code
Design and execute ML experiments with clear hypotheses and rigorous analysis
Optimize model performance and computational efficiency for large-scale protein design tasks
Build tools and utilities to support rapid prototyping and experimentation by the research team

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume