About The Position

Lineage is a forward-thinking and fast-growing organization - operating an international network of state-of-the-art frozen food warehouse facilities and delivering sophisticated supply chain solutions. We believe our people are the key to our success, whatever their role within the organization. Therefore, we provide the environment for them to excel in everything they do. A Reliability Engineer (Connect WMS) has a primary focus on investigating, diagnosing, and resolving operational software issues that impact warehouse management performance. This role is critical in maintaining uptime, optimizing system reliability, and ensuring seamless integration between the Warehouse Management System (WMS), upstream and downstream applications (e.g., EDI, LinOS, Link), and warehouse processes executed by humans or automation equipment. The engineer will analyze logs, debug configurations/code, interface with logistics and IT teams, and work across engineering and operations to deliver high-impact solutions quickly and accurately.

Responsibilities

  • Monitor and respond to operational issues affecting WMS functions (e.g., receiving, shipping, inventory).
  • Analyze system logs, error reports, and transaction flows to identify anomalies or failures.
  • Work closely with Level 1 support and warehouse operation teams to understand incident symptoms and timelines.
  • Execute quick resolutions by using extended user rights, database interventions, or WMS configuration changes.
  • Debug application code, workflows, customizations, and interfaces to identify bugs or performance bottlenecks.
  • Collaborate with WMS QA team to reproduce issues in test environments and trace through application workflows to isolate root causes.
  • Collaborate with Product/Development teams to propose, implement, and test code fixes.
  • Use tools like Datadog or internal diagnostics to monitor WMS behavior.
  • Proactively set up or refine alerts for failure patterns (e.g., inventory mismatches, interface timeouts, RF disconnects).
  • Investigate communication failures between WMS and other Products (e.g., LinOS, Link, EDI, Easymetrics).
  • Troubleshoot integration issues between the WMS and external systems (e.g., DevOps, DCOps).
  • Participate in on-call rotations or site support shifts for time-sensitive incidents.
  • Coordinate with operations, IT, and engineering during critical events to ensure fast resolution.
  • Document incidents thoroughly, including root causes, fixes, and follow-up actions.
  • Contribute to postmortem analysis for high-impact incidents.
  • Recommend and implement configuration changes or process improvements to prevent repeated issues.
  • Update or create playbooks and troubleshooting guides for known WMS issues.
  • Develop scripts or queries (e.g., SQL) to streamline log analysis, system diagnostics, or data validation.
  • Propose internal utilities to detect edge-case failures or performance degradations early.
  • Support development of internal test tooling and simulations for recurring business scenarios.
  • Work with Product/Development teams to escalate and fix production bugs.
  • Collaborate with QA teams to validate fixes or reproduce intermittent issues.
  • Partner with implementation teams to train staff on WMS behavior and provide escalation support.

Benefits

  • safe, stable, reliable work environments
  • medical, dental, and basic life and disability insurance benefits
  • 401k retirement plan
  • paid time off
  • annual bonus eligibility
  • a minimum of 7 holidays throughout the calendar year
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service