Senior Site Reliability Engineer

Tulip InterfacesSomerville, MA
14d$150,000 - $185,000Hybrid

About The Position

Tulip , the leader in frontline operations, is helping companies around the world equip their workforce with connected apps, leading to higher quality work, improved efficiency, and end-to-end traceability across operations. Companies of all sizes and across industries have implemented composable solutions with Tulip’s cloud-native, no-code platform to solve some of the most pressing challenges in operations: error-proofing processes and boosting productivity, capturing and analyzing real-time data, and continuous improvement. A spinoff out of MIT, Tulip is headquartered in Somerville, MA, with offices in Germany and Hungary. Focused on composable, human-centric solutions for industrial environments, Tulip is disrupting the MES category and has been recognized as a World Economic Forum Global Innovator. Tulip has also been named one of Energage’s Top Workplaces USA and one of Built In Boston’s “Best Places to Work” and “Best Midsize Places to Work” for 2024. About You: You have experience building and maintaining stable infrastructure at scale. You can reason about systems — their edge cases, failure modes, and life cycles. You’re excited about setting the technical agenda and coming up with novel, broad ideas. You can debug complex issues across the entire stack. You’re opinionated about the tools and frameworks that work best. You enjoy building for other engineers equally, if not more, than building for a customer. You know what a good SLA looks like, and can teach others how to spot one.

Requirements

  • You have 5+ years of experience working with open source Observability tools (e.g. LGTM stack)
  • You have hands-on experience instrumenting distributed systems using OpenTelemetry and managing metrics pipelines with Prometheus at scale.
  • You have experience working with time-series data, ideally using promQL
  • You can pick up new languages/frameworks with ease. We currently run Go and Typescript services on Kubernetes.
  • You can communicate as well as you can code. You understand the value of discussion and work best in a team that champions clear and frequent communication.

Responsibilities

  • Mentor and evangelize on observability best practices, SLIs/SLOs, and reliability culture across engineering teams.
  • Help architect our systems for growth and scale.
  • Implement internal tools to automate common developer tasks.
  • Perform incident response and debug production issues across the entire stack.
  • Design, build, and maintain the core infrastructure used by all of Tulip’s engineering teams.
  • Work to automate detection and resolution of recurring issues.

Benefits

  • Direct impact on product and culture
  • Company equity
  • Competitive benefits package including Health, Dental, Vision, Short-term Disability, Long-term Disability, Life Insurance, AD&D Insurance, Flexible Spending Account (FSA), Commuter Benefits, Parental Leave, and 401(K)
  • Flexible work schedule and unlimited vacation policy
  • Virtual company events and happy hours
  • Fitness subsidies
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service