Senior AI Platform Engineer Windows

PennymacWestlake Village, CA
1d$95,000 - $155,000Onsite

About The Position

We’re looking for an experienced, forward-thinking engineer to strengthen our Platform Engineering capabilities across AWS and Windows environments. In this role, you’ll drive the design and evolution of scalable, secure, and automated infrastructure to support our Infrastructure and Application stack. You’ll work closely with development teams to streamline CI/CD pipelines, embed security best practices, and champion infrastructure-as-code. If you’re passionate about automation, cloud-native patterns, and making systems run smarter and faster, we want to hear from you.

Requirements

  • Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience).
  • 5+ years of experience in a Platform Engineering, DevOps or Site Reliability Engineering (SRE) role.
  • Extensive hands-on experience with Amazon Web Services (AWS)
  • Solid understanding of Windows Server administration and integration with cloud environments.
  • Proven experience with infrastructure-as-code (IaC) tools, specifically Terraform (OpenTofu) and AWS CDK.
  • Strong experience designing and implementing CI/CD pipelines using GitLab CI/CD.
  • Experience deploying and managing .NET applications in cloud environments.
  • Deep understanding of security best practices and their implementation in cloud infrastructure and CI/CD pipelines.
  • Solid understanding of networking principles (TCP/IP, DNS, load balancing, firewalls) in cloud environments.
  • Experience with monitoring and logging tools (e.g., NewRelic, CloudWatch, Cloud Logging, Prometheus).
  • Strong scripting skills (e.g., Python, Ruby, PowerShell, Bash).
  • Experience with the configuration management tool Chef
  • Excellent problem-solving and troubleshooting skills.
  • Strong communication and collaboration skills.

Nice To Haves

  • Experience with containerization & orchestration technologies (e.g., Docker, Kubernetes) is a plus.
  • Relevant AWS and/or GCP certifications are a plus.
  • Strong understanding of Powershell and Python Scripting, Ruby (chef)
  • Strong background with AWS EC2 features and Services (Autoscaling and WarmPools)
  • Understanding of Windows server Build process using tools like Chocolatey for packages and Packer for AMI/Image generation.
  • Solid experience with the Windows server operating system and server tools such as IIS.
  • Chef: This refers to Chef, a configuration management tool, with a focus on its cookbook-centric approach for Windows environments, specifically versions 17 and 18, Supermarket.
  • SQL: This covers SQL database aspects, including clustering for high availability, configuration management, automation of database tasks, and experience with RDS (Relational Database Service) and EC2 (Elastic Compute Cloud) for SQL instances, with an emphasis on automation
  • DNS/Networking: This involves Domain Name System management, specifically with Microsoft DNS and AWS Route53. Also to include AWS VPC experience (Transit Gateway, routing, endpoints)
  • Active Directory: This refers to Active Directory, covering security best practices, administration, understanding of domain, forest, trust relationships, and Public Key Infrastructure (PKI).

Responsibilities

  • Design, implement, and manage scalable and resilient infrastructure on AWS.
  • Architect and maintain Windows/Linux based environments, ensuring seamless integration with cloud platforms.
  • Develop and maintain infrastructure-as-code (IaC) using both Terraform (OpenTofu) and AWS Cloudformation/CDK.
  • Develop and maintain Configuration Management for Windows servers using Chef.
  • Design, build, and optimize CI/CD pipelines using GitLab CI/CD for .NET applications.
  • Implement and enforce security best practices across the infrastructure and deployment processes.
  • Collaborate closely with development teams to understand their needs and provide Platform Engineering expertise.
  • Troubleshoot and resolve infrastructure and application deployment issues.
  • Implement and manage monitoring and logging solutions to ensure system visibility and proactive issue detection.
  • Clearly and concisely contribute to the development and documentation of Platform standards and best practices.
  • Stay up-to-date with the latest industry trends and technologies in cloud computing, Platform Engineering, and security.
  • Provide mentorship and guidance to junior team members.

Benefits

  • Comprehensive Medical, Dental, and Vision
  • Paid Time Off Programs including vacation, holidays, illness, and parental leave
  • Wellness Programs, Employee Recognition Programs, and onsite gyms and cafe style dining (select locations)
  • Retirement benefits, life insurance, 401k match, and tuition reimbursement
  • Philanthropy Programs including matching gifts, volunteer grants, charitable grants and corporate sponsorships
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service