LLM Security Evaluation Expert

SilverEdgeColumbia, MD
23h

About The Position

SilverEdge Government Solutions is seeking a highly skilled LLM Security Evaluation Expert to join our team. In this role, you will be responsible for rigorously testing the security and integrity of Large Language Models (LLMs). Your primary focus will be on designing and executing sophisticated adversarial prompt attacks to identify potential vulnerabilities, assess the model's resistance to exploitation, and ensure it maintains consistent, secure behavior. This is a critical role in safeguarding our AI systems and ensuring they operate responsibly.

Requirements

  • TS/SCI with Polygraph level Clearance
  • Strong knowledge of how LLMs work, including their architecture, training processes, capabilities, and inherent limitations.
  • Familiarity with prominent LLM families (e.g., GPT series, Claude, Llama, PaLM) and their common characteristics.
  • Proven experience in crafting and refining prompts to elicit specific behaviors or bypass restrictions in LLMs.
  • Demonstrable understanding of techniques like jailbreaking, prompt injection, role-playing attacks, and exploiting model biases.
  • Strong understanding of cybersecurity principles and common attack vectors, particularly as they apply to AI/ML systems.
  • Ability to think like an attacker and anticipate potential exploits.
  • Excellent ability to analyze complex systems, identify subtle vulnerabilities, and systematically test hypotheses.
  • Clear and concise written and verbal communication skills, with the ability to document technical findings thoroughly.
  • Understanding of the ethical implications of AI security and commitment to responsible testing practices.

Responsibilities

  • Rigorously testing the security and integrity of Large Language Models (LLMs)
  • Designing and executing sophisticated adversarial prompt attacks to identify potential vulnerabilities
  • Assess the model's resistance to exploitation
  • Ensure it maintains consistent, secure behavior
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service