Senior Site Reliability Engineer

iHeartMediaAtlanta, GA
5d

About The Position

iHeartMedia, the number one audio company in America , reaches 90% of Americans every month -- a monthly audience that’s twice the size of any other audio company – almost three times the size of the largest TV network – and almost 4 times the size of the largest ad-supported music streaming service. The Senior Site Reliability Engineer will be responsible for leading a talented team of SREs/DevOps Engineers across a wide variety of Cloud Services. This person will be our leader as we move toward a platform / systems architecture and infrastructure that is highly automated, fully instrumented, self-scaling, self-healing and loosely coupled. Must be a go-getter with efficient multi-tasking abilities along with efficient people management skills.

Requirements

  • 6+ years of hands-on experience in public cloud specifically AWS.
  • 3+ years of leading SRE/DevOps teams across complex AWS ecosystems.
  • Deep understanding of high velocity SDLC best practices along with CI/CD & Application/infrastructure Monitoring practices to operate workloads at high scale.
  • Expert proficiency in Kubernetes, Terraform, AWS CDK, Lambda, API Gateway, Route53, S3, EC2, Load Balancing, DynamoDB, CloudWatch, IAM, Networking, IOT, SQS, Event Bridge, etc.
  • Adept at solving & troubleshooting High volume Distributed architecture applications running on AWS.
  • Demonstrated ability to design, build, and maintain AWS infrastructure using AWS CDK (TypeScript preferred) with strong modular patterns (multi-stack, multi-account, multi-region).
  • Strong understanding of GitOps methodologies, experience in implementing and managing multiple environments through declarative configuration management versioned in Git repos and applied via automated tools like Flux or ArgoCD.
  • Hands-on experience managing large-scale, production EKS clusters across multiple regions and AWS accounts.
  • Deep knowledge of AWS Cost optimization techniques such as Reserved Instances, Spot Instances, and Life Cycle Management.
  • Proven ability to build highly secure AWS Infrastructure with a security first mindset.
  • Proven ability to collaborate and build strong relationships with development teams including Conflict Resolutions & driving decisions/initiatives.
  • Strong software development background including knowledge of microservices architecture along with fluency in JavaScript, TypeScript, or Node.JS or Python.
  • At least one among the following AWS Certifications: AWS Solution Architect Associate AWS Solution Architect Professional AWS DevOps Associate AWS DevOps Professional Professional Kubernetes Certifications

Nice To Haves

  • Respect for others and a strong belief that others should do this in return
  • Expertise with various technical disciplines and applications
  • Close attention to detail and quality orientation
  • Ability to multitask on a variety of critical projects
  • Ability to work independently, while also collaborating with others
  • Strong communication skills, particularly when explaining complex technical information
  • Ability to provide solutions to problems in situations that are atypical/infrequent
  • Analytical thinking and the ability to identify patterns
  • Efficiency with own work and impact of team results
  • Informal leadership capabilities with an interest in mentoring less experienced team members

Responsibilities

  • Standardize and modernize Amazon EKS platforms & AWS Serverless Suites, including all Cutting-Edge Managed Services from AWS adhering to DevOps best practices.
  • Provide expertise and hands on implementation of large-scale, mission critical Kubernetes workloads with High Resiliency and multi-region architecture.
  • Work collaboratively with 2 to 5 Site Reliability Engineers.
  • Champion accountability; take responsibility through actions & thoughts.
  • Design and implement end-end CI/CD pipelines with CDK and CodePipeline, including integrating with source control, build tools and deployment targets like CFT stacks.
  • Prioritize & re-align quickly to adapt to a demanding fast paced Shift Left environment.
  • Maximize automation to improve speed and quality while relentlessly driving low-value, repetitive work out of our operational activities.
  • Work with our application delivery teams to design and build scalable and maintainable solutions for our customers.
  • Enforce GitOps workflow where Git is the source of truth for EKS clusters and app state in a multi-account and multi-region environment (FluxCD/ArgoCD).
  • Develop baselines for governance, consumption/cost and performance to ensure that our elastic cloud-based applications operate efficiently, securely and with zero down time.
  • Run Reliability Incident management processes along with Root Cause Analysis, developing Runbooks, & Self-Healing architecture.
  • Instill Standardization in DevOps processes across a wide range of applications.

Benefits

  • Employer sponsored medical, dental and vision with a variety of coverage options
  • Company provided and supplemental life insurance
  • Paid vacation and sick time
  • Paid company holidays
  • A Spirit day to encourage and allow our employees to more easily volunteer in their community
  • A 401K plan
  • Employee Assistance Program (EAP) at no cost – services include telephonic counseling sessions, consultation on legal and financial matters, emotional well-being, family and caregiving
  • A range of additional voluntary programs, such as spending accounts, student loan refinancing, accident insurance and more!
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service