AI Engineer

PepsiCoTown/Village of Harrison, NY

About The Position

As an AI Engineer specializing in Agentic AI enablement, you will participate in the design and delivery of production-grade agent capabilities built on the enterprise AI Backbone across cloud and edge environments – across supply-chain and global functions. You will be responsible for end-to-end delivery of key agent modules and integration patterns (MCP/tooling), establish strong evaluation and regression discipline, and drive adoption by partnering with transformation teams, BU, platform engineering, and enterprise application owners. You serve as a technical engine for the workstream—translating business workflows into measurable agent outcomes, working to mitigate identified risks, evaluating/experimenting with options/tradeoffs, and working to scale solutions across domains.

Requirements

  • Bachelor’s in CS/AI/ML or equivalent experience required
  • 6-8 year experience in Software life cycle
  • Expertise in ML (structured and unstructured data) development and engineering
  • Proven experience shipping LLM/agent solutions to production with measurable quality and operational practices.
  • Advanced Software Engineering: Python (and Java) mastery with distributed systems expertise; performance optimization (profiling, parallelization); architecture patterns (e.g., FastAPI, asyncio, Pydantic)
  • LLM & Agent Systems: Multi-agent orchestration (LangChain, LangGraph, CrewAI); advanced prompt engineering; custom agent memory architectures; model optimization techniques
  • Evaluation Framework Development: Statistical evaluation design (confidence intervals, power analysis); benchmark creation; instrumentation frameworks (e.g., MLflow, Arise); regression testing systems
  • ML Operations: Production deployment pipelines (Docker, Kubernetes, Ray); model registry management; scaled inference optimization; GPU utilization optimization
  • Enterprise Integration: Enterprise connector development; scalable API architectures; data pipeline engineering (Kafka, gRPC, Redis); authorization protocol implementation
  • Observability Engineering: Telemetry system design (Prometheus, OpenTelemetry); automated anomaly detection; distributed tracing; performance dashboarding (Grafana)
  • System Architecture: Microservice design patterns; high-throughput event processing; fault-tolerance implementation; horizontal scaling architectures
  • Technical Leadership: Architecture governance systems; engineering standards development; build-vs-buy evaluation frameworks; technical roadmap creation
  • Ownership: drives outcomes end-to-end for a workstream area (not just tasks)
  • Collaboration & customer focus: influences stakeholders to deliver workflow value and adoption
  • Communication & adaptability: providing clarity on progress, risks, and evaluation evidence to business, technical and PMO stakeholders
  • Proactiveness & initiative anticipates constraints, proposes options/tradeoffs early
  • Strategic thinking: contributes to roadmap sequencing and reusable patterns across domains
  • Demonstrates proven history of creating solutions with order-of-magnitude improvements over standard approaches
  • Possesses rare combination of deep technical expertise and business understanding
  • Creates solutions that scale beyond their direct involvement (leveraged impact)
  • Consistently elevates the performance of teams and individuals around them
  • Identifies and solves problems others haven't recognized yet
  • Maintains extraordinary productivity while ensuring knowledge transfer
  • Balances technical perfectionism with pragmatic business value
  • Communicates complex technical concepts effectively to both technical and non-technical stakeholders

Nice To Haves

  • Master’s preferred
  • Full-stack dev experience on modern stack
  • Modelling User Interactions with AI Systems; Modeling multi-agent behaviour loops with tools like Temporal
  • Agentic memory Patterns and usage with tools like MEM0 and Temporal
  • Experience with Agentic RAG; Domain level Semantic Layer Designs with Graph and Vector DBs

Responsibilities

  • Lead design and productionization of high-leverage agent modules and reusable patterns (tool-use orchestration, policies/guardrails, memory, RAG where it adds measurable value), built as composable components and reference implementations.
  • Translate ambiguous product/problem statements into concrete agent behaviors and system designs: state models, failure modes, tool contracts, latency budgets, and acceptance criteria that engineering + product can execute against.
  • Deliver quickly without sacrificing quality: create thin vertical slices, iterate with evidence, and converge on robust behavior under real-world constraints.
  • Drive meaningful performance gains via systematic optimization: latency, token efficiency, tool-call success, retrieval quality, and cost per successful task, including remediation of long-tail failure modes.
  • Proactively identify platformizable opportunities: refactor one-off implementations into shared frameworks/SDKs that reduce build time for others.
  • Define and implement evaluation strategies for assigned workflows: golden sets, scenario coverage maps, regression suites, online/offline metrics, and release gating thresholds aligned to real business outcomes.
  • Build repeatable evaluation systems (templates, labeling guidance, dataset/versioning conventions, dashboards/reports) so evaluation becomes a productized capability, not ad hoc testing.
  • Implement robust automated testing across layers: unit tests for prompt/tool wrappers, contract tests for tool schemas, integration tests for toolchains, and agent simulation tests for multi-step flows.
  • Lead root-cause analysis of quality failures (hallucinations, tool misuse, retrieval misses, routing errors): isolate causes (prompt/tool/data/model), implement corrective actions, and prevent regressions.
  • Champion evidence-first iteration: decisions and releases are backed by eval results, not gut feel.
  • Contribute to router design and task-to-model mapping through routing rules/classifiers, prompt strategies, and model selection policies; validate decisions using evaluation data and runtime telemetry.
  • Propose and implement routing improvements when constraints change (pricing, latency, throughput, new model capabilities), with governance-aware rollouts and rollback plans.
  • Identify and mitigate routing failure modes (over-escalation to expensive models, under-routing causing quality loss, brittle heuristics) and improve robustness using lightweight ML or rules where appropriate.
  • Lead implementation of MCP connectors/clients for enterprise apps and internal data products with strong engineering hygiene: schema/versioning discipline, typed contracts, scopes/permissions, auditability, and integration test strategy.
  • Build reusable integration patterns: standardized tool metadata, error normalization, retries/timeouts, idempotency, pagination handling, and consistent auth patterns to accelerate onboarding of new tools.
  • Collaborate with security/data owners to ensure secure-by-design tool access (least privilege, logging, PII handling, policy enforcement).
  • Ensure production readiness for owned components: telemetry coverage, structured logging, traceability for tool calls, SLIs/SLO alignment (latency, success rate, cost), and participation in incident response and postmortems.
  • Proactively identify delivery risks (dependencies, rate limits, data quality, security scopes, vendor constraints) and drive resolution with clear tradeoffs and recommendations.
  • Mentor peers through technical leadership: raise code quality, share patterns, review PRs for correctness/performance/security, and contribute to internal playbooks.

Benefits

  • Bonus based on performance and eligibility target payout is 10% of annual salary paid out annually.
  • Paid time off subject to eligibility, including paid parental leave, vacation, sick, and bereavement.
  • Medical, Dental, Vision, Disability, Health, and Dependent Care Reimbursement Accounts, Employee Assistance Program (EAP), Insurance (Accident, Group Legal, Life), Defined Contribution Retirement Plan.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service