Government Specialist - Freelance AI Trainer Project

Invisible Agency

89d•$10 - $30•Remote

About The Position

We are sourcing independent Government Specialists to provide their expertise for an AI benchmark evaluation project. As AI models increasingly generate professional-grade public policy memos, regulatory frameworks, and civic administration deliverables, their accuracy relies entirely on robust, expert-crafted training data. The objective of this project is to autonomously produce high-quality evaluation tasks, strong prompts, and clear, well-structured rubrics that generate clean, reliable data for model training. Project Deliverables & Scope Operate autonomously to design complex evaluation frameworks and provide structured training data. Expected deliverables include: Task & Prompt Creation: Generating realistic, high-quality prompts that compel the AI model to produce complex, professional-grade deliverables specific to public administration, government policy, and civic operations. Rubric Development: Writing clear, well-structured evaluation rubrics with criteria that are highly specific, non-ambiguous, and easy to score. Benchmark Evaluation Data Generation: Producing clean, reliable training data that directly aids in the evaluation and refinement of AI models handling complex governmental and regulatory tasks. Quality Assurance & Fact-Checking: Ensuring all generated tasks and scoring criteria reflect strict, real-world legislative standards, bureaucratic protocols, and public policy frameworks.

Requirements

Demonstrable professional expertise within the public administration, legislative affairs, civic planning, or regulatory compliance sectors, with a deep understanding of government standards, bureaucratic terminology, and public sector deliverables.
Strong writing and prompt generation skills, with the ability to design highly realistic, complex civic task scenarios for AI evaluation.
Proficiency in rubric generation, specifically the ability to create objective, non-ambiguous scoring criteria that leave no room for subjective interpretation.
A meticulous, detail-oriented approach to fact-checking policy documents, legislative frameworks, and civic guidelines to generate reliable data for system benchmarking.

Responsibilities

Task & Prompt Creation: Generating realistic, high-quality prompts that compel the AI model to produce complex, professional-grade deliverables specific to public administration, government policy, and civic operations.
Rubric Development: Writing clear, well-structured evaluation rubrics with criteria that are highly specific, non-ambiguous, and easy to score.
Benchmark Evaluation Data Generation: Producing clean, reliable training data that directly aids in the evaluation and refinement of AI models handling complex governmental and regulatory tasks.
Quality Assurance & Fact-Checking: Ensuring all generated tasks and scoring criteria reflect strict, real-world legislative standards, bureaucratic protocols, and public policy frameworks.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume