DiffUSE is a project at the Astera Institute focused on developing open infrastructure for studying protein dynamics directly from experimental data. The project operates at the intersection of structural biology (crystallography, cryo-EM), modern machine learning, computational biophysics, and open scientific tooling. The team comprises computational biologists, ML researchers, software engineers, and program staff, collaborating across Astera, Radial, and partner institutions. DiffUSE is expanding and is seeking individuals excited about contributing to their mission, even if a specific role isn't currently posted. Submissions are reviewed on a rolling basis, and interested candidates will be contacted if a suitable opportunity arises now or in the future. The project is actively building around several key areas: Computational and data science, including diffraction data processing, structural data pipelines, multiconformer and heterogeneity analysis, data standards (mmCIF), macromolecular ensemble metrics, and machine learning research for macromolecules and biophysics (representation learning, ML on raw experimental data, 3D vision, geometric deep learning). They are also focused on Dataset generation and open release, which involves designing and running campaigns for large structural datasets, partnering with external collaborators for data standardization, and bridging experimental facilities, data producers, and the open-science community. Lastly, they are building out Software and infrastructure engineering capabilities, focusing on scientific data infrastructure, pipelines, tooling, and open-source release engineering, reproducibility, and developer experience, as well as Program and operations roles in program management, scientific coordination, communications, and open-science publishing and community building.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Education Level
No Education Listed