Staff Site Reliability Engineer

Dat SolutionsBeaverton, OR
258d$155,000Hybrid

About The Position

DAT is an award-winning employer of choice and a next-generation SaaS technology company that has been at the leading edge of innovation in transportation supply chain logistics for 45 years. We continue to transform the industry year over year, by deploying a suite of software solutions to millions of customers every day - customers who depend on DAT for the most relevant data and most accurate insights to help them make smarter business decisions and run their companies more profitably. We operate the largest marketplace of its kind in North America, with 400 million freights posted in 2022, and a database of $150 billion of annual global shipment market transaction data. Our headquarters are in Denver, CO, with additional offices in Missouri, Oregon, and Bangalore, India. For additional information, see www.DAT.com/company.

Requirements

  • Strong leadership and mentoring abilities, especially with SRE or Platform Engineering/Infrastructure teams.
  • Total of 10+ years industry experience.
  • 3+ years of software engineering experience (JavaScript, Python, Go, Java/Kotlin, C++, etc).
  • Extensive experience with modern observability tools (Datadog preferred).
  • Extensive experience with cloud platforms (preferably AWS).
  • Demonstrated success in leading large technical initiatives, including design, project management and gaining executive buy-in.
  • Proven experience modernizing legacy code and infrastructure.
  • Ability to work closely with peer teams, platform/software architects and management to drive key reliability improvements.
  • Deep understanding of cloud infrastructure, automation, and best practices for reliability.
  • Experience with our tools (Kubernetes, ArgoCD, Terraform, Github Actions) a plus.

Responsibilities

  • Collaborate with platform architects and management to ensure reliability targets are met.
  • Advise engineering teams on best practices for measuring reliability and uptime.
  • Assist and respond to critical engineering incidents.
  • Lead and mentor SRE engineers to improve their engineering skills.
  • Provide technical guidance and best practices for use of cloud infrastructure and tooling.
  • Be a driver for Infrastructure-as-Code within the platform.
  • Spearhead major reliability-focused initiatives and projects.
  • Help optimize our work to be customer-focused.
  • Migrate legacy systems to modern, scalable cloud environments.
  • Help develop and drive a culture of continuous improvement with the Platform Engineering and Software Engineering groups.
  • Participate in an on-call rotation and occasionally act as Incident Commander.

Benefits

  • Medical, Dental, Vision, Life, and AD&D insurance
  • Parental Leave
  • Up to 20 days of paid time off starting in year one
  • An additional 10 holidays of paid time off per calendar year
  • 401k matching (immediately vested)
  • Employee Stock Purchase Plan
  • Short- and Long-term disability sick leave
  • Flexible Spending Accounts
  • Health Savings Accounts
  • Tuition Reimbursement Program
  • Employee Assistance Program
  • Additional programs - Employee Referral, Internal Recognition, and Wellness
  • Free TriMet transit pass (Beaverton Office)
  • Competitive salary and benefits package
  • Work on impactful projects in a cutting-edge environment
  • Collaborative and supportive team culture
  • Opportunity to make a real difference in the trucking industry
  • Employee Resource Groups
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service