Government Specialist - Freelance AI Trainer Project

Invisible Agency
$10 - $30Remote

About The Position

We are sourcing independent Government Specialists to provide their expertise for an AI benchmark evaluation project. As AI models increasingly generate professional-grade public policy memos, regulatory frameworks, and civic administration deliverables, their accuracy relies entirely on robust, expert-crafted training data. The objective of this project is to autonomously produce high-quality evaluation tasks, strong prompts, and clear, well-structured rubrics that generate clean, reliable data for model training. Project Deliverables & Scope Operate autonomously to design complex evaluation frameworks and provide structured training data. Expected deliverables include: Task & Prompt Creation: Generating realistic, high-quality prompts that compel the AI model to produce complex, professional-grade deliverables specific to public administration, government policy, and civic operations. Rubric Development: Writing clear, well-structured evaluation rubrics with criteria that are highly specific, non-ambiguous, and easy to score. Benchmark Evaluation Data Generation: Producing clean, reliable training data that directly aids in the evaluation and refinement of AI models handling complex governmental and regulatory tasks. Quality Assurance & Fact-Checking: Ensuring all generated tasks and scoring criteria reflect strict, real-world legislative standards, bureaucratic protocols, and public policy frameworks.

Requirements

  • Demonstrable professional expertise within the public administration, legislative affairs, civic planning, or regulatory compliance sectors, with a deep understanding of government standards, bureaucratic terminology, and public sector deliverables.
  • Strong writing and prompt generation skills, with the ability to design highly realistic, complex civic task scenarios for AI evaluation.
  • Proficiency in rubric generation, specifically the ability to create objective, non-ambiguous scoring criteria that leave no room for subjective interpretation.
  • A meticulous, detail-oriented approach to fact-checking policy documents, legislative frameworks, and civic guidelines to generate reliable data for system benchmarking.

Responsibilities

  • Task & Prompt Creation: Generating realistic, high-quality prompts that compel the AI model to produce complex, professional-grade deliverables specific to public administration, government policy, and civic operations.
  • Rubric Development: Writing clear, well-structured evaluation rubrics with criteria that are highly specific, non-ambiguous, and easy to score.
  • Benchmark Evaluation Data Generation: Producing clean, reliable training data that directly aids in the evaluation and refinement of AI models handling complex governmental and regulatory tasks.
  • Quality Assurance & Fact-Checking: Ensuring all generated tasks and scoring criteria reflect strict, real-world legislative standards, bureaucratic protocols, and public policy frameworks.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service