General Motors-posted 4 months ago
$195,000 - $298,800/Yr
Full-time • Senior
Austin, TX
5,001-10,000 employees

The Software Engineering Site Reliability Engineer (SRE) is responsible for ensuring the reliability, scalability, and performance of software systems. Their job profile includes: Monitoring the performance and availability of software systems, identifying and resolving issues, and implementing proactive measures to prevent future incidents. Developing and maintaining automation tools and infrastructure to streamline software deployment, configuration management, and system monitoring. Analyzing system performance, identifying bottlenecks, and implementing optimizations to improve the efficiency and scalability of software systems. Responding to incidents, conducting root cause analysis, and implementing corrective actions to prevent similar incidents in the future. Collaborating with software development teams to ensure that reliability and scalability considerations are incorporated into the software design and implementation. Identifying opportunities for process improvement, implementing best practices, and driving initiatives to enhance the reliability and performance of software systems.

  • Monitoring the performance and availability of software systems.
  • Identifying and resolving issues.
  • Implementing proactive measures to prevent future incidents.
  • Developing and maintaining automation tools and infrastructure.
  • Analyzing system performance and identifying bottlenecks.
  • Implementing optimizations to improve efficiency and scalability.
  • Responding to incidents and conducting root cause analysis.
  • Collaborating with software development teams.
  • Identifying opportunities for process improvement.
  • 8+ years of relevant professional experience.
  • Bachelor's degree in Computer Science or a related field, or equivalent work experience.
  • Proficiency in at least one programming language (e.g., Python, Go, Java).
  • Solid understanding of operating systems, networking, distributed systems, databases, and storage architectures.
  • Deep understanding of how code runs on underlying hardware.
  • Ability to optimize or troubleshoot code by understanding its execution.
  • Proven experience in automating manual processes.
  • Experience handling production incidents.
  • Strong communication skills.
  • Experience with cloud platforms (AWS, GCP, Azure).
  • Familiarity with container orchestration systems like Kubernetes.
  • A track record of managing or developing distributed systems.
  • Prior experience with Java in production.
  • Medical, dental, vision insurance.
  • Health Savings Account.
  • Flexible Spending Accounts.
  • Retirement savings plan.
  • Sickness and accident benefits.
  • Life insurance.
  • Paid vacation & holidays.
  • Tuition assistance programs.
  • Employee assistance program.
  • GM vehicle discounts.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service