At LatchBio, we build the benchmarks that frontier AI labs use to evaluate and train models on biological reasoning. RefusalBench tests whether AI systems can distinguish legitimate biological research from requests that present meaningful biosecurity risks. We're looking for scientists with deep expertise in areas such as biosecurity, biosafety, pathogen genomics, infectious disease research, synthetic biology, biodefense, and public health. Your role will be to apply that expertise to determine where the boundary lies between routine scientific work and potentially dangerous biological capabilities. You will review real-world biological analyses, protocols, datasets, and research papers to establish ground truth for AI evaluations. Some tasks should clearly be allowed. Others should clearly be refused. Many sit in the gray area. Your job is to identify the relevant risks, justify the correct decision, and convert those examples into structured evaluations that test whether AI systems reach the same conclusion.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Education Level
No Education Listed