About The Position

FreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. FreeWheel is seeking an SRE to join the Freewheel OPS team based in Denver, CO or Chicago, IL. As a member of the Global Operation team, you will be responsible for ensuring the reliability, scalability, and performance of Freewheel systems. Working closely with engineers and other operation sub-teams, you will manage infrastructure, optimize system reliability, automate daily operations, and resolve technical issues that impact upstream/downstream platform. The company offers SRE positions in 2 different areas: SRE3-Steaming Hub and SRE3-Data. While each area has a slightly different day-to-day focus depending on the development teams they support, the core responsibilities and requirements remain consistent. If candidates would like to focus on the SRE3-Data part, the daily task will more focus on backend data components.

Requirements

  • 3+ years of experience as an SRE, DevOps or Operations Engineer.
  • Experience with an automation tool or framework such as Ansible, Terraform, Kubernetes, Docker for automating system deployment.
  • Proficient in at least one programming language, such as Python, Go, Java, or Scala, with the ability to write efficient scripts and automation tools.
  • Familiar with using monitoring and log management tools such as Prometheus, Grafana, ELK Stack, or other similar tools.
  • Excellent communication skills with the ability to convey technical information clearly and concisely to both technical and non-technical stakeholders.
  • Bachelor’s degree or higher in Computer Science, Software Engineering, or a related field.

Nice To Haves

  • Experience with cloud platforms (e.g. AWS, OCI, GCP, Azure) is a plus.
  • Hands-on experience with Terraform and infrastructure as code principle is a huge plus.
  • Proactive learner eager to grow in operations and governance.

Responsibilities

  • Design and implement monitoring and alerting systems to ensure the stability, reliability, and performance of data platforms.
  • Join in on-call shift to quickly respond to and resolve issues.
  • Develop and maintain automation tools and scripts for deployment, monitoring, backup and disaster recovery.
  • Analyze and optimize the performance of data storage, query performance, and data flows to ensure efficient processing of large-scale datasets, reduce latency, an improve processing speed.
  • Respond quickly to platform failures, perform troubleshooting, and coordinate cross-team efforts to resolve issues and ensure high availability and reliability.
  • Work with engineering teams to analyze and forecast capacity requirements, ensuring the system can handle traffic growth and scale infrastructure accordingly.
  • Support Freewheel powered Live events.
  • Document the architecture, configurations, and operational procedures for platforms, ensuring knowledge is shared across the team and providing relevant training.
  • Ensure platforms meet security standards and compliance requirements to prevent breaches or misuse.
  • Collaborate with engineering team, product team, and project management team to support product design and implementation, solving reliability-related issues.

Benefits

  • Most non-sales positions are eligible for a Bonus.
  • Comcast provides best-in-class Benefits to eligible employees.
  • Benefits should connect you to the support you need when it matters most, and should help you care for those who matter most.
  • Array of options, expert guidance and always-on tools, that are personalized to meet the needs of your reality – to help support you physically, financially and emotionally through the big milestones and in your everyday life.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service