About The Position

Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. This is a project-based opportunity, not permanent employment. You will design computational material science problems to challenge a frontier AI model. The problems must have answers verifiable by code and require specialized tools like ObsPy, instaseis, pyrocko, MITgcm, flopy/MODFLOW, or others. Generic data wrangling around synthesized toy data will not suffice. Each problem runs inside a sealed Linux container with the tool pre-installed and a programmatic judge that grades the model's answer. As an expert author, you will pick an anchor tool and design a problem that hinges on its specific functionalities. You will write a Python reference solution, supply input files, and decide the numerical answer and acceptable tolerance. You will test the problem against the model in batches, tuning the difficulty until the agent succeeds in a small number of attempts. After review and approval, the task is passed to a senior reviewer in your subfield for final quality assurance. This process involves calibrating the problem against batches of parallel runs, aiming for a pass rate of 10-30%, which may require rewriting scenarios, tightening parameters, and observing agent behavior. This experience will deepen your command of the anchor tool and provide practical insight into how AI models navigate complex problems.

Requirements

  • Material scientists & engineers with experience in Python.
  • Open to part-time, non-permanent projects.
  • Degree in Material Science or related field.
  • 2+ years of research, applied, or teaching experience.
  • Python proficiency for writing reference solutions.
  • Fluency with — or strong willingness to independently learn — at least one scriptable package: ObsPy, instaseis, pyrocko, MITgcm, xmitgcm, flopy / MODFLOW, or GeoPandas.
  • Ability to design problems that genuinely require a specialized solver.
  • Strong written English (C1+).
  • Readiness to learn specialized tools independently if not already proficient.

Responsibilities

  • Pick an anchor tool and design a problem that hinges on its waveform-processing kernels, geophysical inversion routines, sub-surface flow solvers, or community-validated data pipelines.
  • Write a Python reference solution, supply input files and model or domain definitions where needed.
  • Decide the numerical answer and how close the model needs to get — with a domain-appropriate tolerance — to count as right.
  • Test the problem against the model in batches of parallel attempts, tuning the problem difficulty until the agent only succeeds in a small number of attempts.
  • Collaborate with senior reviewers to ensure task quality is high.

Benefits

  • Project-based AI opportunities
  • Work with leading tech companies
  • Up to $35 per hour equivalent compensation
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service