Site Reliability Engineer - Digital Enablement

TargetBrooklyn Park, MN
5dHybrid

About The Position

As a Site Reliability Engineer within Digital Enablement, you specialize in building and supporting the platforms and tools that enable teams to deliver reliable, scalable digital experiences. You translate architectural concepts into practical, resilient solutions and apply strong engineering patterns to improve system reliability and operational efficiency. You influence design and implementation decisions, provide technical guidance to partner teams, and help diagnose and resolve complex issues in distributed systems. You leverage automation and best practices to prevent repeat problems and strengthen the overall stability of Target’s digital ecosystem. About this Team: The Digital Enablement team builds and supports the foundational platforms, tools, and reliability frameworks that empower Target’s digital product teams to deliver high-quality guest experiences at scale. We focus on improving engineering efficiency, operational excellence, and system reliability across the digital ecosystem. Our team partners closely with engineers across the enterprise to provide modern deployment tooling, observability frameworks, and best-in-class reliability practices that enable teams to innovate quickly while maintaining a highly resilient production environment. We work at the intersection of software engineering, cloud infrastructure, and site reliability driving standards, automation, and proactive problem-solving to ensure Target’s digital platforms remain fast, stable, and scalable. Use your skills, experience, and talents to be a part of groundbreaking thinking and visionary goals. As a Site Reliability Engineer, you’ll take the lead as you…

Requirements

  • BS degree in computer science or related technical field, or equivalent experience
  • 2+ years of software engineering experience, including operating applications at scale
  • 1+ years of experience with common CI/CD tools and modern deployment practices
  • 1+ years of experience working with major cloud platforms (AWS, GCP, Azure)
  • Hands-on experience building, debugging, deploying, and scaling applications—especially those built with Node.js, React, or Next.js
  • Experience developing software solutions using Bash, Node.js, and Golang
  • Experience containerizing applications and running them on orchestration platforms such as Kubernetes and Docker
  • Experience with observability tools and practices, including log aggregation, metrics, and distributed tracing
  • Basic understanding of caching, traffic routing, and edge reliability/disaster mitigation strategies
  • Basic proficiency with version control systems such as Git
  • Strong verbal and written communication skills demonstrating technical leadership
  • Strong analytical, debugging, and troubleshooting skills

Nice To Haves

  • Deep knowledge and hands-on experience with Kubernetes and GitHub Actions
  • Knowledge and experience with caching and traffic-routing technologies, including but not limited to Fastly
  • Experience conducting advanced metric analysis, including anomaly detection
  • Familiarity with designing infrastructure to support scalable web e-commerce systems
  • Experience building systems that use modern progressive deployment patterns (e.g., canary releases, red/black deployments, automated canary analysis)
  • Familiarity with standard security practices, including encryption, certificate management, and key management
  • Awareness of new and emerging technologies and trends in reliability engineering, observability, and cloud platforms

Responsibilities

  • Use your technology acumen to apply and maintain knowledge of current and emerging technologies within specialized area(s) of the technology domain.
  • Evaluate new technologies and participate in decision-making, accounting for several factors such as viability within Target’s technical environment, maintainability, and cost of ownership.
  • Initiate and execute research and proof-of-concept activities for new technologies.
  • Manage total product, financials, and forecasting.
  • Lead the design, lifecycle management, and total cost of ownership of services.
  • Lead and conduct code review, design review, testing, and debugging activities at the application level.
  • Lead functional design and architecture discussions with understanding process flows and system diagrams to enable design decisions.
  • Participate in routine and non-routine construction, automation, and implementation activities, ensuring successful implementation with architectural and operational requirements and best practices met.
  • Provide technical oversight and coach others to resolve complex and severe technical issues.
  • Lead disaster recovery activities and contribute to disaster recovery planning.
  • Embed data quality protocols throughout data acquisition, processing, storage, and operational use.

Benefits

  • Target offers eligible team members and their dependents comprehensive health benefits and programs, which may include medical, vision, dental, life insurance and more, to help you and your family take care of your whole selves.
  • Other benefits for eligible team members include 401(k), employee discount, short term disability, long term disability, paid sick leave, paid national holidays, and paid vacation.
  • Find competitive benefits from financial and education to well-being and beyond at https://corporate.target.com/careers/benefits.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service