Platform Engineer Lead - AWS and DevOps

QualcommSan Diego, CA
49dOnsite

About The Position

We are seeking a highly skilled and experienced Platform Engineer Lead to join our global team, working in a 24x7 support model for business-critical applications deployed in a hybrid environment - AWS and on-premise datacenter. The Lead will play a pivotal role in ensuring the reliability, scalability, and performance of our applications. The role requires full-time onsite work in San Diego, CA (5 days per week).

Requirements

  • Bachelor's degree in Information Technology, Computer Science, or related field.
  • Certifications: CKA, AWS Solutions Architect or SysOps Administrator.
  • Expertise in DevSecOps, K8s administration, Jenkins CI/CD.
  • Proficiency in container orchestration (Docker, Git, JFrog, Maven, SonarQube, Secrets manager).
  • Experience in triaging and resolving AWS security vulnerabilities.
  • Strong critical thinking skills to debug complex technical issues independently.
  • Experience working with multi-cloud environments (AWS and Azure).
  • Development skills in Java, Python, AngularJS, ReactJS, PowerShell.
  • Strong communication skills and ability to prioritize tasks effectively.

Nice To Haves

  • Experience with AWS ML Platform solutioning using tools such as AWS Bedrock, SageMaker Studio, Data Lake, EKS, Carpenter, Airflow, S3/Glacier, Aurora MySQL, EMR, Airflow, Lambda, EC2, Event Bridge, SQS/SNS and Network configs (VPC, PrivateLink, IAM roles, SG, ALB, WAF, IP policies), CloudFormation, CDK, RDS, Data Lake - Snowflake.
  • Machine Learning & AI - Bedrock, Terraform IAC, Azure.
  • Working knowledge of AWS deployment architecture (IAM/Security best practices, EKS, Networking).
  • Good understanding of server, storage, and Well Architected Frameworks (WAF).
  • Able to execute multiple projects with good communication and prioritization with service owners.
  • Strong communication - will take part in Agile Scrum/PI planning for project/task prioritizations and lead discussions for SRE project updates, dependencies, technical roadmap, and infrastructure cost forecasting.

Responsibilities

  • Collaborate closely with service owners to design scalable platform architecture with resiliency.
  • Drive business projects with clear objectives to the SRE team that align with business roadmaps, task delegation, and implement Agile practices.
  • Foster a culture of accountability by leading by example and driving blameless retrospectives for debug post-mortems.
  • Oversee application deployments, infrastructure provisioning, issue debugging, and timely execution using JIRA agile methodology.
  • Ensure high-quality documentation, including detailed SRE operations, configuration steps, custom procedures/scripts, testing/validation processes, and knowledge base articles.
  • Drive capacity and cost optimization across all applications.
  • Administer applications running on hybrid infrastructure (on-premise & cloud).
  • Define SLI/SLO and implement APM dashboards (Datadog, Splunk).
  • Automate complex manual tasks using scripting tools (Python, Shell/PS) and configuration management tools (Ansible).
  • Debug and perform database tuning (MySQL, MongoDB, RDS, SQL Server).
  • Implement general security best practices (Azure AD, MFA/SSO, Audit & Log event tracking, RBAC).
  • Provision automations with Terraform IAC and DevOps tools for CI/CD framework.
  • Utilize AWS Bedrock for deployments.
  • Work with data ingestion, transformation, warehousing, machine learning, and data analytics concepts using AWS Redshift, Data Lakes, and Snowflake.
  • Develop automation using Lambda, EventBridge, SQS/SNS and manage EMR clusters.
  • Integrate PowerBI for data visualization.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Industry

Computer and Electronic Product Manufacturing

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service