Engineering & Built Environment Experts

Weekday AI

11h•$90 - $110•Remote

About The Position

We are building a benchmark dataset to evaluate AI models on professional document understanding and instruction following within the Engineering & Built Environment domain. Tasks consist of complex, multi-step requests grounded in real-world workspace files (technical drawings, project specifications, engineering reports), web search, and code execution — each paired with a clearly defined ground truth output and an objective evaluation rubric. You will be responsible for authoring tasks that test an AI's ability to interpret engineering documentation, follow multi-step instructions, and produce precise, well-structured outputs.

Requirements

3+ years of hands-on experience in one or more of the following sub-domains: Mechanical engineering, Civil engineering, Industrial engineering, Architecture
Ability to commit 15-20 hours per week

Responsibilities

Authoring tasks that test an AI's ability to interpret engineering documentation
Authoring tasks that test an AI's ability to follow multi-step instructions
Authoring tasks that test an AI's ability to produce precise, well-structured outputs