Engineer Monitoring Systems

Academy Sports + OutdoorsKaty, TX
Onsite

About The Position

At Academy Sports + Outdoors our vision is to be the best sports + outdoors retailer in the country — but what truly sets us apart is our people. We’re a passionate, purpose-driven team that’s as committed to each other as we are to our customers. We’ve spent over 80 years building a culture that puts people first. We believe in creating opportunities for growth, fostering meaningful connections, and supporting every Team Member’s journey. What fuels us? Our belief in the power of fun. Here, you won’t just help customers gear up for their next adventure — you’ll launch one of your own. Whether you're starting out or leveling up, Academy is a place where fun can’t lose! Academy® Sports + Outdoors is one of the nation’s largest sporting goods and outdoor retailers. It’s no surprise that we not only know how to create experiences for our customers, but for our team members as well. Understanding our people and the things that matter to them the most has been at the core of the Academy® culture for over 80 years. With more than 22,000 team members, we take pride in creating a workplace environment that values hard work, commitment, and growth.

Requirements

  • Bachelor’s degree in Computer Science, Information Technology, or a related field, or equivalent practical experience
  • 5+ years of hands-on, enterprise-level experience administering major monitoring platforms (Splunk, Dynatrace, SolarWinds)
  • Working knowledge of operating systems, which include, but are not limited to, Microsoft Windows, RedHat & Ubuntu Linux
  • Working knowledge of databases, networking, virtualization, and server hardware concepts
  • Working knowledge of application technologies, which include, but are not limited to: Java, .NET, IIS, WebSphere, middleware
  • Expert knowledge of Splunk administration with multi-instance cluster deployments and advanced query language (SPL), DBConnect, IT Service Intelligence, and custom Apps
  • Deep hands-on experience with Dynatrace (SaaS or Managed) for Application Performance Monitoring (APM), Real User Monitoring (RUM), and Synthetic Monitoring, Custom service monitoring, custom Notebooks, build Workflows, Grail
  • Proficiency in managing and configuring SolarWinds modules (e.g., NPM, SAM, NCM, DPA) for network and server, database performance, and capacity management
  • Strong understanding of IT infrastructure domains (e.g., cloud networking, virtualization, Windows/Linux OS, database management & security vulnerability management)
  • Scripting proficiency (Python, PowerShell, or Bash) for automation with API & SNMP integration tasks
  • Acceptable level of hearing and vision to perform job duties
  • Adhere to company work hours, policies, procedures, and rules governing professional staff behavior
  • Regular office attendance is required

Nice To Haves

  • Relevant industry certifications (e.g., Splunk Certified Admin, Dynatrace Associate/Professional, Cisco/AWS/GCP certifications) are a strong plus
  • Proven experience in a large-scale retail, e-commerce, or high-transaction environment is highly desirable
  • Experience with offshore teams preferred

Responsibilities

  • Platform Leadership: Serve as the subject matter expert (SME) for our core monitoring platforms, including Splunk, Dynatrace, and SolarWinds
  • Architecture & Design: Design, implement, and maintain the health, performance, and security of monitoring tool infrastructure across cloud and on-premises environments
  • Alerting & Automation: Architect and deploy advanced alerting, correlation, and notification strategies to enable proactive incident response and root cause analysis (RCA)
  • E-commerce & Retail Focus: Develop specialized dashboards and monitoring protocols for our high-traffic e-commerce platform and thousands of retail POS/network devices
  • Integration & Consolidation: Drive initiatives to integrate monitoring tools and consolidate data streams, reducing complexity and increasing operational efficiency
  • Compliance & Capacity: Manage tool licensing, capacity planning, and platform upgrades while ensuring systems adhere to internal security and compliance standards
  • Vulnerability Management: Track potential security vulnerabilities, identify & remediate, adhering to organizational security standards
  • Documentation & Training: Create and maintain comprehensive platform documentation, runbooks, and provide advanced training and support to application and operations teams
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service