Site Reliability Engineering Production Engineer

Samsung ElectronicsTaylor, TX
2dOnsite

About The Position

About Samsung Austin Semiconductor Samsung is a world leader in advanced semiconductor technology, founded on the belief that the pursuit of excellence creates a better world. At SAS, we are Innovating Today to Power the Devices of Tomorrow. Come innovate with us! Position Summary As a Production Engineer you'll be responsible for monitoring system performance, availability, and reliability. You'll take charge during operational incidents, oversees recovery efforts, and facilitates continuous improvement by conducting thorough incident analyses and developing standard operating procedures. You'll also collaborate with global teams in Korea and local stakeholders to ensure effective sensor integration, alarm management, and database resolution.

Requirements

  • BS Degree in Computer Science/Engineering or related major.
  • 3+ years of experience in a software development or DevOps role or 2 years plus on monitoring development role including Splunk, Machine Learning
  • Basic understanding of System Architectures (Software & Hardware Interactions, Networks).
  • Must have understanding of SQL; Microsoft SQL Server and Oracle experience preferred.
  • Knowledge in programming languages (Shell Scripts C# .NET or Java SE required).
  • Knowledge/Experience with Oracle & SQL Server Databases.
  • Knowledge/Experience with both Windows & Unix Environments.
  • Knowledge/Experience with shell script, bash script, PowerShell script, Framework and Web development.

Responsibilities

  • Engage in the active development of sensors designed to monitor performance, availability, and reliability metrics within systems
  • Employ a variety of monitoring techniques to ensure optimal operational efficiency.
  • Serve as the designated incident commander; taking proactive role in overseeing system and fabrication operations during incidents and the subsequent recovery process.
  • Conduct a comprehensive analysis of incidents and issues, which includes determining the root cause, implementing containment measures, and facilitating the closure of the matter.
  • Develop, document, and revise Standard Operating Procedures (SOPs) to enhance operational support.
  • Improve development skills in system operations, focusing on Java, shell, Bash, and PowerShell scripting for monitoring, alarm management, sensor integration, and system/database issue resolution.
  • Collaborate with both international team members in Korea and local personnel to address issues and ensure effective integration with other related systems.

Benefits

  • Medical, dental, and vision insurance
  • Life insurance and 401(k) matching with immediate vesting
  • Onsite café(s) and workout facilities
  • Paid maternity and paternity leave
  • Paid time off (PTO) + 2 personal holidays and 10 regular holidays
  • Wellness incentives and MORE
  • Eligible full-time employees (salaried or hourly) may also receive MBO bonuses based on company, division, and individual performance.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service