Prima Mente’s goal is to deeply understand the brain, to protect the brain from neurological disease and enhance the brain in health. We do this by generating our own data, building brain foundation models, and translating discovery to real clinical and research impact. Role focus - Biological Data Infrastructure at Petabyte Scale Key Tasks: Owning and scaling our data infrastructure by several orders of magnitude to handle > 100 petabyte-scale multi-omic datasets, including data pipelines, distributed data processing, and storage systems Building a unified feature store for all our ML models and biological data analysis workflows Efficiently storing and loading petabytes of data for ML bio data Processing and storing predictions and evaluation metrics for large-scale biological forecasting and analysis models Implementing data versioning and point-in-time correctness systems for evolving biological datasets Building observable, debuggable data pipelines that handle the complexity of multi-omic data sources Expected Growth In 1 month you will be responsible for analyzing current data infrastructure bottlenecks, implementing initial optimizations to existing pipelines, and beginning work on scaling our feature store infrastructure for ML models. In 3 months you'll directly own and have scaled key components of our data processing systems, built prototype streaming pipelines for real-time data ingestion, and contributed to designing our unified feature store architecture. In 6 months you'll have implemented high-performance petabyte-scale data infrastructure, established data versioning and point-in-time correctness systems, and delivered measurable improvements in data processing throughput and reliability. Why Join Us: Meaningful Impact: Contribute directly to research infrastructure that powers discoveries potentially impacting millions of lives. Innovation & Autonomy: Work at the forefront of AI and multi-omics, with the freedom to propose and implement state-of-the-art infrastructure solutions. Exceptional Team: Collaborate with talented colleagues from diverse backgrounds across ML, bioinformatics, and engineering. Growth Opportunities: Continuous learning and growth opportunities in a rapidly advancing technical field.