Reliability Engineer – Automation Engineer

BlackRockWilmington, DE
5d$110,000 - $138,000Hybrid

About The Position

We are looking for a highly skilled and dynamic individual to join our Production Engineering team within the Aladdin Engineering. This role is perfect for someone passionate about technical troubleshooting, optimizing system performance, and developing innovative automation solutions. In addition to strong expertise in secrets management and automation, this role increasingly focuses on applied AI, including AI-assisted operations and Retrieval-Augmented Generation (RAG)–based assistants, to improve reliability, operability, and developer productivity Role Overview: As a Reliability Engineer, you will be responsible for ensuring the performance, scalability, and stability of our platforms through automation, intelligent tooling, and AI-assisted workflows. You will build and operate automation and AI-driven solutions that enhance system reliability, optimize operational efficiency, and improve how engineers interact with platforms—particularly around Vault infrastructure and secrets management. This includes leveraging Python, AI frameworks, and data-driven approaches to analyze system behavior, automate diagnostics, and enable self-service capabilities via AI assistants.

Requirements

  • Bachelor’s degree (or equivalent) in Computer Science, Engineering, Mathematics, or a related field.
  • Strong Python development skills for automation, systems analysis, and integration with AI tooling.
  • Experience with cloud platforms such as AWS and Azure.
  • Hands-on experience with Ansible, Terraform, and configuration-as-code practices.
  • Experience using monitoring and observability tools to track and optimize resource utilization.
  • Foundational understanding of AI/ML concepts, particularly as applied to automation, observability, or developer tooling.

Nice To Haves

  • Experience with Linux-based server environments.
  • Familiarity with Kubernetes and containerized platforms.
  • Exposure to RAG architectures, vector databases, or AI assistant frameworks in production or platform contexts.
  • Prior experience in financial services or large-scale technology environments.
  • Experience designing systems for high availability, scalability, and fault tolerance.
  • Azure certification.
  • Certification or hands-on experience with enterprise secrets management platforms.
  • Experience building or operating AI-powered internal developer platforms or assistants.

Responsibilities

  • Design, build, and operate reliable, scalable platforms using automation, Infrastructure as Code (Terraform, Ansible), CI/CD, and GitOps practices across cloud and containerized environments.
  • Automate operational workflows using Python and scripting, including secure configuration and enterprise secrets management lifecycle (provisioning, rotation, access, and recovery).
  • Develop and integrate AI-assisted operational tooling, including RAG-based assistants, to support incident response, troubleshooting, diagnostics, and engineer self-service.
  • Build and maintain knowledge ingestion and retrieval pipelines (logs, metrics, runbooks, configuration data) to power AI assistants and intelligent automation.
  • Monitor and analyze system health, performance, and capacity using observability tools (e.g., logs, metrics, dashboards), and perform root cause analysis for platform incidents.
  • Participate in on-call rotations, supporting production systems and driving continuous improvement through automation and AI-driven reduction of operational toil.
  • Support disaster recovery planning and execution, application onboarding, upgrades, and change management for platforms relying on centralized secrets and secure configuration services.
  • Ensure all automation and AI solutions are secure, explainable, and production-ready, meeting enterprise and regulated-environment requirements.

Benefits

  • employees are eligible for an annual discretionary bonus, and benefits including healthcare, leave benefits, and retirement benefits.
  • strong retirement plan
  • tuition reimbursement
  • comprehensive healthcare
  • support for working parents
  • Flexible Time Off (FTO)
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service