IT Services Manager

DOOTA INDUSTRIAL AMERICA LLCAustin, TX
Onsite

About The Position

This role is responsible for daily monitoring and support of GCS, PCS, and related systems to ensure high availability, data integrity, and life-safety–critical reliability. The position serves as the first-line responder for alarms and incidents, manages tickets and system changes, supports PCS drivers and industrial communications, and follows formal change management procedures. The role also focuses on preventive maintenance, continuous improvement, documentation, reporting, and cross-functional coordination to minimize risk, meet SLAs, and maintain stable operations. Join Doota Industrial America and be part of a thriving company that values safety, professional growth, and collaboration. Embark on an exciting career journey and make a tangible impact on the lives of those around you.

Requirements

  • Strong operational experience with monitoring and control systems.
  • Hands-on experience with relational databases, with a strong emphasis on Oracle and/or MS SQL.
  • Experience writing, analyzing, and troubleshooting SQL queries.
  • Understanding of database schemas, table relationships, and data flows.
  • Experience investigating system issues through data analysis, queries, and historical records.
  • Operational experience in Linux-based environments (basic shell usage and log inspection).
  • Experience analyzing logs, alarms, trends, and system metrics to identify root causes.
  • Experience supporting production systems where database integrity and availability are critical.
  • Ability to quickly understand existing database structures and system data models.
  • Ability to trace how data moves between systems, services, and monitoring components.
  • Ability to assess the operational impact of data anomalies, missing records, or performance degradation.
  • Familiarity with application-level technologies such as .NET, C#, Java, or similar languages (for reading logs or understanding system behavior).
  • 3–10+ years of experience in operations, maintenance, or systems engineering within Semiconductor manufacturing, Industrial automation, or Mission-critical or life-safety systems.
  • Demonstrated ability to operate under defined SLAs and escalation models.
  • Strong analytical, troubleshooting, and documentation skills.
  • Ability to work independently while coordinating across multiple cross-functional teams.
  • Clear communication skills for incident reporting, governance reviews, and client interaction.
  • Applicants must be authorized to work in the U.S. without the need for employment-based visa sponsorship now or in the future.

Nice To Haves

  • Basic understanding of UI or scripting technologies (e.g., JavaScript) is a plus, but not required.

Responsibilities

  • Perform daily health checks and operational monitoring of GCS 1.0 / GCS 2.0, iEES / FMCS interfaces, iQMS infrastructure, and iPCS monitoring and PCS drivers.
  • Monitor system performance across Buffer, Web, Application, IO, and Database servers.
  • Review logs, alarms, events, barcode history, and interface transmission status to detect abnormalities.
  • Ensure uninterrupted data integrity and system availability (target uptime = 99.99%).
  • Act as first-line responder for GCS / PCS alarms, communication failures, and abnormal system behavior.
  • Perform rapid troubleshooting and recovery actions for equipment communication errors, PCS driver failures, and interface disruptions (iEES, FMCS, Infra-EES).
  • Support emergency response activities during off-shift, night shift, and maintenance windows.
  • Participate in incident investigations, including root cause analysis (RCA), 8D report creation and follow-up, and countermeasure development and deployment.
  • Manage and respond to support tickets and VOCs involving tag/parameter add, change, delete; equipment add/change/removal; IO mapping updates; UI, buffer, AP, DB, barcode, and interface issues.
  • Validate requests, define corrective actions, execute standard activities, and escalate non-standard items per governance.
  • Maintain accurate documentation and activity records in designated management systems.
  • Perform daily inspections and maintenance of PCS drivers (~300+ assets).
  • Manage PCS driver configuration, firmware updates, IP and communication settings.
  • Troubleshoot industrial communication protocols including RS232 / RS485 / Ethernet, Modbus, Omron, OPC.
  • Coordinate with PCS, Network, and equipment teams to resolve persistent or systemic issues.
  • Execute change point activities in compliance with governance-defined MOC procedures.
  • Support pre- and post-change monitoring (Day 1 / Day 3 / Day 7 verification).
  • Prepare and review release documentation, checklists, and impact assessments.
  • Validate system behavior following hardware, software, or infrastructure upgrades.
  • Proactively identify weak points, recurring issues, and performance degradation trends.
  • Support system standardization initiatives (e.g., GCS 2.0 standard compliance).
  • Develop and update SOPs, operational runbooks, and standardized checklists.
  • Implement automation and monitoring enhancements where applicable.
  • Prepare and submit required reports, including daily and weekly operational status reports, off-shift support summaries, changepoint summaries with evidence, and KPI performance tracking.
  • Participate in weekly operations meetings and quarterly business reviews.
  • Coordinate closely with IT Strategy & Mobile / Information Management teams, Facilities Operations, and Network, ECA, GCS, PCS, and equipment engineering teams.
  • Escalate system risks, SLA impacts, and out-of-scope requests as required.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

1-10 employees

© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service