AI Production Support Engineer

TradewebJersey City, NJ
5h$175,000 - $210,000Hybrid

About The Position

This role is a senior Production Support position with a strong focus on applying Artificial Intelligence to improve operational efficiency, incident response, and root cause analysis within the Credit trading platform. The primary goal of the role is to understand existing production issues, investigation patterns, and support workflows, and to design and build AI-powered tools and solutions that reduce mean time to detection (MTTD), mean time to resolution (MTTR), and overall operational toil. The role sits at the intersection of production support, software engineering, and AI enablement, partnering closely with Development, QA, and Business teams to modernize how production issues are identified, analyzed, and resolved. Tradeweb Technology jobs are fully remote. The Tradeweb Technology hub is located in our Jersey City office which can be used for team meetings and collaboration efforts. There may be days where travel to the Jersey City office is recommended for organizational off-sites.

Requirements

  • Bachelor’s degree in Computer Science, Engineering, or equivalent experience.
  • 10+ years of overall IT experience, with significant experience in Production Support or Site Reliability roles or DevOps roles.
  • Strong hands-on experience supporting large-scale, highly available financial or trading systems with complex architecture, distributed systems and system troubleshooting.
  • Solid programming background with experience in one or more of: C++
  • Python
  • Node.js
  • Scripting languages (Shell, Bash, Perl)
  • Experience working with: Logging and monitoring tools (e.g. Coralogix, Grafana)
  • Containers and orchestration (Docker, Portainer)
  • Messaging/streaming platforms (e.g., Kafka)
  • Databases (relational and non-relational)
  • Excellent communication skills and ability to work with technical and non-technical stakeholders.
  • Hands-on experience using AI coding and assistant tools to build/enhance software solutions.
  • Practical experience applying AI to: Code analysis
  • Log analysis
  • Automation
  • Workflow optimization
  • Building MCP servers & AI Agents
  • Familiarity with modern AI tooling ecosystems; preferred tools include: OpenAI
  • Claude
  • Cursor
  • Ability to evaluate AI-generated outputs critically and apply them safely in production environments.
  • Experience building small internal tools, scripts or services that improve operational productivity.

Nice To Haves

  • Understanding of source control systems (e.g., Git) and collaborative development workflows.
  • Familiarity with ticket management systems such as ServiceNow and Jira, especially for analyzing historical incidents.
  • Networking knowledge, including TCP/UDP, multicast, and packet analysis tools (e.g., Wireshark).
  • Experience operating in regulated or security-conscious environments.
  • Fixed income or bond trading domain knowledge.
  • Exposure to AI enablement, developer productivity, or platform engineering roles.

Responsibilities

  • Analyze historical production incidents and ticket data to identify recurring patterns, investigation paths, and bottlenecks.
  • Design and build AI-assisted tools to: Accelerate root cause identification
  • Summarize logs, alerts, and metrics
  • Suggest likely failure domains or components
  • Assist with incident triage and prioritization
  • Partner with Production Support engineers to embed AI into day-to-day workflows, not as standalone experiments.
  • Develop internal tools, scripts, or lightweight services that leverage AI models to improve support efficiency.
  • Apply AI coding assistants to rapidly prototype, iterate, and productionize operational tooling.
  • Document AI-driven workflows, playbooks, and best practices for use by the wider support organization.
  • Measure and track impact of AI adoption (reduction in MTTR, investigation time, manual effort).
  • Provide extremely high levels of availability and stability for production, demo, and test environments supporting Credit trading.
  • Perform deep dives into application logs, metrics, and codebases to understand system behavior and failure modes.
  • Support monitoring, alerting, and observability platforms (e.g., logs, dashboards, alerts).
  • Work with development team and AI teams, to partner in building out new AI related features in AI.

Benefits

  • Health Insurance: Highly competitive medical, dental, and vision programs
  • Hybrid Environment: Our employees have the flexibility of working in the office and from home.
  • Health Care and Dependent Care Flexible Spending Accounts: You may elect to set aside pre-tax earnings to pay for eligible health care and dependent day care expenses for you and your eligible family members.
  • Maven Family Building Benefit: Maven offers support for fertility and preconception; pregnancy and post-partum; adoption; surrogacy and pediatrics for children up to age 10. Tradeweb provide a $10,000 lifetime reimbursement towards fertility, egg freezing, adoption and surrogacy expenses.
  • Building Wealth - 401(k) Savings Plan: Employees are immediately eligible for the 401(k) plan. Participants may contribute up to 75% of eligible compensation into a traditional 401(k) and/or Roth 401(k). Tradeweb will match 100% of the first 4% of compensation that you contribute.
  • Pre-Tax Commuter Benefits Program
  • ARAG Legal Services
  • Employee Assistance Program
  • Tuition Reimbursement
  • Financial Wellness Tools
  • Travel Assistance Benefits
  • Pet Insurance
  • Corporate Gym Subsidies
  • Wellness Perks
  • Paid Time Off and Parental Leave
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service