Senior Engineer - AI Evaluator

G2i Inc.•Miami, FL

About The Position

We’re looking for highly experienced software engineers (SR+) to help evaluate the quality of interactions with modern coding agents such as OpenAI Codex and Claude Code. This is not a traditional engineering role where you will be writing production code. Instead, you’ll be evaluating something harder: whether the model thinks like a great engineer. You will assess how AI coding agents behave in real-world scenarios, focusing on whether the response makes sense, whether the preamble and reasoning are useful, whether the output reflects strong engineering judgment, and whether the interaction feels right to an experienced developer. This role is about engineering taste — not syntax correctness.

Requirements

Staff / Principal-level engineer (or equivalent experience).
Strong background in one of the following: TypeScript / JavaScript or Python.
Hands-on experience using OpenAI Codex, Claude Code, and Cursor.
Deep familiarity with modern AI-assisted dev workflows.
Able to evaluate code without needing to fully execute or deeply review every line.
Comfortable giving direct, opinionated feedback.
High bar for what “good engineering” looks like.

Nice To Haves

Experience with tools like Cursor or similar AI-first IDEs.
Prior exposure to prompt design or evaluation workflows.
Experience mentoring senior engineers or defining engineering standards.

Responsibilities

Evaluate AI-generated coding interactions end-to-end.
Judge whether outputs are useful, correct (at a high level), and aligned with how a strong engineer would think.
Assess the quality of explanations and reasoning, not just code.
Distinguish between different levels of response quality (e.g. what makes something a 2 vs 4).
Provide clear, opinionated feedback on what worked, what didn’t, and what felt “off” or misleading.
Help define what great looks like when interacting with tools like Cursor.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume