Trading Technologies International, Inc.-posted about 1 month ago
Full-time • Mid Level
Hybrid • Chicago, IL
501-1,000 employees
Professional, Scientific, and Technical Services

The Site Reliability Engineer (SRE) position is a software development-oriented role, focusing heavily on coding, automation, and ensuring the stability and reliability of our global platform. The ideal candidate will primarily be a skilled software developer capable of participating in on-call rotations. The SRE team develops sophisticated telemetry and automation tools, proactively monitoring platform health and executing automated corrective actions. As guardians of the production environment, the SRE team leverages advanced telemetry to anticipate and mitigate issues, ensuring continuous platform stability.

  • Develop and maintain advanced telemetry and automation tools for monitoring and managing global platform health.
  • Actively participate in on-call rotations, swiftly diagnosing and resolving system issues and escalations from the customer support team (this is not a customer-facing role).
  • Implement automated solutions for incident response, system optimization, and reliability improvement.
  • Provide operational support for backend services and Kafka producers/consumers written in Python running on ECS.
  • Full-Stack Troubleshooting: Support, debug, and enhance the entire application stack, from our React.js frontend to our Python backend services (Flask, Litestar, Celery, ESK, MSK)
  • Hands-on experience building and/or supporting applications written with React.js. Must have professional experience building and/or supporting applications with React.js. Effectively troubleshoot issues between the frontend UI and backend APIs.
  • Minimum 3 years of experience with Python
  • Solid understanding of functional programming, object oriented programming and computer science foundations
  • Good understanding of backend and server side components
  • Ability to work on-call rotation for support with global team members on a semi-frequent basis
  • Proven and strong communication skills
  • Must be self-directed, flexible and have the ability to prioritize and handle multiple projects simultaneously
  • Experience with Icinga2, Prometheus, or Splunk a plus
  • Experience with AWS a plus
  • Experience working in an Agile environment a plus
  • Pension contributions
  • Enjoy the best of both worlds: the energy and collaboration of in-person work, combined with the convenience and focus of remote days. This is a hybrid position requiring three days of in-office collaboration per week, with the flexibility to work remotely for the remaining two days. Our hybrid model is designed to balance individual flexibility with the benefits of in-person collaboration, enhanced team cohesion, spontaneous innovation, hands-on mentorship opportunities and strengthens our company culture.
  • 25 days of Paid Time Off (PTO) per year, with the option to roll over unused days.
  • One dedicated day per year for volunteering.
  • Two professional development days per year to allow uninterrupted professional development.
  • An additional PTO day added during milestone anniversary years.
  • Generous parental leave for all parents (including adoptive parents).
  • Budget for tech accessories, including monitors, headphones, keyboards, and other office equipment.
  • Milestone anniversary bonuses.
  • Subsidy contributions toward gym memberships and health/wellness initiatives (including discounted healthcare premiums, healthy meal delivery programs, or smoking cessation support).
  • Forward-thinking, culture-based organization with collaborative teams that promote diversity and inclusion.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service