Software: Operations & Reliability Lead

Truenorth CorporationGuaynabo, PR
1d

About The Position

We’re looking for an experienced Operations & Reliability Lead to strengthen our monitoring, security, automation, and cloud operations. This role drives reliability, resilience, and a security‑first posture across all systems and environments.

Requirements

  • Strong experience with monitoring tools (New Relic, Datadog, Prometheus, Azure Monitor, etc.).
  • Hands‑on expertise with cloud platforms, IaC, CI/CD, and configuration management.
  • Solid understanding of security frameworks, threat detection, and compliance.
  • Experience with backup/DR strategies and resilience best practices.
  • Strong troubleshooting, documentation, and cross‑team collaboration skills.
  • Bachelor's degree in Computer Science or related field.
  • At least 2 years of experience working with systems.

Nice To Haves

  • Cloud or security certifications (Azure/AWS Architect, Security+, CISSP, ITIL, SRE).
  • Experience with AI‑Ops platforms or ML‑based operational tooling.
  • Background in regulated industries.

Responsibilities

  • Build and maintain application and infrastructure monitoring, dashboards, and automated alerts.
  • Implement cloud and On Premise resource provisioning and enforce standardized configuration baselines.
  • Manage backup, recovery, and resilience workflows with regular testing cycles.
  • Conduct AI‑assisted performance testing, security audits, and penetration testing.
  • Coordinate with NOC and SOC to support continuous monitoring and threat detection.
  • Lead incident response, root‑cause analysis, and operational readiness activities.
  • Implement cost optimization and resource governance across cloud environments.
  • Automate operational tasks and integrate AI‑Ops capabilities.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service