AI Ops Team Lead

Ocean InfinityPorto, VA
1d

About The Position

Ocean Infinity is seeking an experienced AI Ops Team Leader who combines deep technical expertise in ML infrastructure and data engineering with strong leadership capabilities. The ideal candidate is passionate about building scalable platforms that empower AI teams to develop, deploy, and monitor services efficiently. They have hands-on experience designing and implementing robust MLOps and data pipelines, and are driven by the challenge of enabling high-performance AI development through automation, observability, and architectural excellence. In addition, the ideal candidate is a collaborative leader who can translate technical needs into strategic platform capabilities, ensuring alignment with business goals and future-state architecture. They are skilled at mentoring engineers, fostering cross-team collaboration, and cultivating a culture of reliability and innovation. This role is pivotal in shaping the foundation of Ocean Infinity’s AI capabilities and accelerating the delivery of intelligent solutions across the organization.

Requirements

  • Proven experience in leading technical teams, ideally in AI/ML or data engineering domains.
  • Strong background in ML infrastructure, MLOps, and scalable data systems.
  • Proficient in Python, CI/CD pipelines, containerization (Docker), and orchestration tools (Airflow, Terraform).
  • Familiar with ML frameworks (TensorFlow, PyTorch), data orchestration tools, and cloud platforms (Azure preferred).
  • Experience with observability tools and event-driven architectures.
  • Excellent communication and collaboration skills.
  • Passionate about building platforms that empower others and drive innovation.

Nice To Haves

  • Experience with Kubernetes (AKS), LangChain, Nvidia Triton Server.
  • Familiarity with scalable data versioning tools (e.g., lakeFS).
  • Prior experience in mentoring and developing leadership talent.

Responsibilities

  • Lead and mentor a cross-functional team of ML and Data Engineers.
  • Design and implement the AI Ops Platform architecture, including infrastructure, tooling, and data pipelines.
  • Ensure scalability and reliability of the platform to support high-throughput AI workloads and LLM applications.
  • Collaborate with stream-aligned teams to understand their needs and deliver platform capabilities that accelerate development and deployment.
  • Drive automation in deployment, data processing, and monitoring workflows.
  • Establish best practices in MLOps, data engineering, and infrastructure-as-code.
  • Promote observability by implementing monitoring systems (e.g., Prometheus, Grafana, OpenTelemetry).
  • Facilitate team planning, goal setting, and execution within timelines.
  • Recruit and develop talent, fostering a culture of learning and collaboration.
  • Coordinate other departments to ensure resource availability and compliance.

Benefits

  • At Ocean Infinity, we believe in creating equal opportunities for all, celebrating each and everyone’s differences. We are driven by transforming the industry, through our technology, thoughts, behaviours and actions. Being inclusive and respectful to all is fundamental to who we are. It is the right thing to do and enables innovation and creativity to thrive. There is more work to be done, and we know that we aren’t perfect, but our commitment to these values is unwavering. They are central to our mission and the impact we have on the industry, meaning, we cannot live without them. Simply put, our mission is to use innovative technology, to transform operations at sea, to enable people and the planet to thrive.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service