AI Platform Engineer

ERP SuitesLoveland, OH
Remote

About The Position

ERP Suites develops Enterprise AI agents and orchestration solutions hosted on Oracle Cloud Infrastructure (OCI). Our products automate complex finance, supply chain, and operational workflows for enterprise customers. We are seeking an AI Platform Engineer who thrives in a high-ownership environment and is passionate about building, operating, and scaling the infrastructure that powers next-generation AI solutions. The AI Platform Engineer will be responsible for the platform foundation that supports ERP Suites' AI products and customer environments. This role partners closely with the AI Architect and Product Team to deploy, manage, secure, and optimize Oracle Cloud Infrastructure environments, automation pipelines, and AI agent deployment frameworks. The ideal candidate is a hands-on cloud and infrastructure professional with experience in DevOps, AI platform operations, automation, security, observability, and enterprise cloud architecture.

Requirements

  • Bachelor’s degree in computer science, Information Systems, Engineering, or a related field.
  • 2+ years of experience in AI Platform Engineering, Infrastructure Engineering, MLOps, DevOps, or Cloud Engineering.
  • Strong experience with Oracle Cloud Infrastructure (OCI), including:
  • Experience deploying and supporting AI agents, microservices, or cloud-native applications.
  • Experience with monitoring and observability platforms such as Grafana, LangFuse, OCI Logging, and Metrics APIs.
  • Knowledge of TLS, DNS, ACME protocols, Let's Encrypt, and certificate automation.
  • Experience with CI/CD tools, source control, deployment pipelines, and artifact management.
  • Proficiency in Python, SQL, and Bash scripting.
  • Strong technical writing, documentation, and architecture diagramming skills.
  • Excellent communication and collaboration skills.
  • Cloud infrastructure architecture and administration
  • AI platform operations and deployment
  • DevOps and CI/CD automation
  • Monitoring, observability, and FinOps
  • Security architecture and identity management
  • Infrastructure-as-Code and automation
  • Technical troubleshooting and root cause analysis
  • Customer-facing technical consulting
  • Documentation and knowledge transfer

Nice To Haves

  • Oracle Cloud certifications such as OCI Architect Professional or OCI DevOps Professional.
  • Experience supporting multi-tenant SaaS or managed-service environments.
  • Exposure to large language model (LLM) infrastructure and agentic AI frameworks such as LangChain, MCP, or similar technologies.
  • Experience implementing AI observability platforms such as LangFuse, MLflow, or equivalent tools.
  • Familiarity with JD Edwards EnterpriseOne, including CNC, AIS, Orchestrator Studio, or security administration.
  • Experience participating in Oracle Partner Network, Oracle ACE, or similar technical communities.

Responsibilities

  • Monitor OCI service health, logs, dashboards, and alerts.
  • Troubleshoot platform issues and customer environment concerns.
  • Support development teams with infrastructure questions and deployment needs.
  • Manage CI/CD pipeline performance and resolve deployment failures.
  • Execute provisioning, onboarding, and configuration requests.
  • Update documentation and architecture artifacts as infrastructure evolves.
  • Participate in standups, planning meetings, and technical reviews.
  • Review OCI consumption reports, billing dashboards, and cost optimization opportunities.
  • Conduct IAM, security, and credential audits.
  • Evaluate reference architecture environments for configuration drift and required updates.
  • Refine deployment methodologies, runbooks, and onboarding documentation.
  • Assess Oracle OCI roadmap updates and emerging platform capabilities.
  • Contribute technical documentation, architecture guidance, and internal knowledge-sharing content.
  • Provision, configure, and manage Oracle Cloud Infrastructure (OCI) environments, including computer, networking, load balancers, API gateways, IAM, containers, and related services.
  • Manage OCI Functions, Autonomous Database Serverless (ADB-S), and containerized deployment environments.
  • Build, maintain, and optimize OCI DevOps pipelines, artifact repositories, and deployment automation.
  • Support OCI Goldengate planning, configuration, and data replication architectures.
  • Develop automation solutions that improve reliability, scalability, and operational efficiency.
  • Own customer-facing AI agent deployment methodologies, runbooks, environment configurations, and deployment standards.
  • Coordinate customer environment provisioning, compartment creation, IAM setup, and onboarding activities.
  • Manage AI agent environments across development, testing, and production stages.
  • Support development teams through infrastructure reviews, deployment guidance, and technical troubleshooting.
  • Maintain and extend ERP Suites' enterprise reference architectures and deployment frameworks.
  • Build and maintain Grafana dashboards and reporting solutions for operational monitoring and customer billing.
  • Develop ETL processes that aggregate OCI cost and consumption data.
  • Monitor platform health, performance, reliability, and resource utilization.
  • Diagnose and resolve observability gaps before they impact customer environments.
  • Ensure accurate reporting and billing visibility across customer environments.
  • Audit OCI IAM policies, Vault usage, credential management processes, and security controls.
  • Maintain TLS certificate automation using ACME, Let's Encrypt, and OCI Load Balancer integrations.
  • Support secure architecture reviews and infrastructure compliance initiatives.
  • Ensure proper access controls, credential rotation, and security best practices across environments.
  • Create and maintain architecture diagrams, infrastructure maps, deployment workflows, and technical documentation.
  • Document automation scripts, deployment processes, and operational procedures.
  • Participate in technical planning sessions with customers and internal stakeholders.
  • Identify infrastructure risks and recommend scalable solutions.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service