NTT America-posted about 2 months ago
Full-time • Mid Level
Austin, TX
5,001-10,000 employees

This Lead AWS Public Senior Cloud Engineer is responsible for advanced technical support, administration, and optimization of managed customer cloud environments spanning AWS, Azure, Google Cloud Platform (GCP), and Oracle Cloud Infrastructure (OCI) with primary focus on AWS. This position demands deep multi-cloud expertise, a strong understanding of managed services operations, and a proactive, problem-solving outlook. The Senior Cloud Engineer will also participate in automation initiatives, incident and change management, and mentor junior team members. Minimal Travel expectation 10% to Austin, TX.

  • Support Customer Self-Provision cloud instances across AWS, Azure, GCP, and OCI with security guardrail and backend deployment.
  • Monitor, troubleshoot, and resolve incidents, performance issues, and service outages in production and staging environments.
  • Implement and maintain monitoring, alerting, and logging solutions to ensure high availability and reliability.
  • Lead root cause analysis and post-mortem documentation for major incidents.
  • Execute patch management, upgrades, and regular maintenance activities.
  • Develop and maintain backup, disaster recovery, and failover strategies and operations.
  • Participate in on-call rotation and after-hours support as required.
  • Develop and maintain Infrastructure as Code (IaC) templates using tools such as Terraform, CloudFormation, ARM, or OCI Resource Manager.
  • Use scripting (e.g., Python, Bash, PowerShell) to automate repetitive tasks and operational processes.
  • Champion the use of configuration management tools and assist in DevOps pipeline integrations.
  • Recommend and implement cost optimization, resource utilization, and rightsizing strategies.
  • Ensure adherence to security best practices, including least-privilege access, encryption, and network segmentation.
  • Implement and manage identity and access management (IAM) policies and roles.
  • Monitor, identify, and remediate security vulnerabilities reported by scanning tools or external advisories.
  • Support compliance efforts related to customer and regulatory requirements (TxRAMP, ISO, SOC2, etc.).
  • Work closely with application, security, and network teams for solution delivery and support.
  • Mentor junior engineers and provide technical guidance as needed.
  • Create and update technical documentation, runbooks, and SOPs.
  • Participate in client calls to provide technical input when required.
  • Must have Primary Skll: AWS SME
  • Secondary Skill : GCP/OCI
  • 10+ years of hands-on experience in cloud engineering, operations, and/or support.
  • 8+ years multi-cloud experience (must have hands-on in at least 5 of AWS/Azure/GCP/OCI; familiarity in all is preferred; AWS and GCP/OCI cloud are mandatory)
  • Must have hands-on experience in architecture, deployment, monitoring, and troubleshooting in major public cloud platforms (AWS, Azure, GCP, OCI).
  • Bachelor's degree (or equivalent experience) in Computer Science, IT, Engineering, or a related field.
  • At least two of the following certifications (or equivalent experience): AWS Certified Solutions Architect / SysOps Administrator Microsoft Certified: Azure Administrator Associate or Solutions Architect Expert Google Professional Cloud Architect / Engineer Oracle Cloud Infrastructure Architect Associate/Professional
  • Direct experience in managed services/NOC/SOC/MSP environments is a plus.
  • In-depth expertise with provisioning, configuring, securing, supporting, and optimizing cloud-native and hybrid workloads in AWS, Azure, GCP, and/or OCI.
  • Administration of compute, storage, networking, database, and PaaS services across supported platforms.
  • Experience with CI/CD pipelines, containerization (Docker, Kubernetes), and automation tools (Terraform, CloudFormation, ARM templates, etc.).
  • Familiarity with cloud-native security best practices (IAM, network security, data encryption, etc.).
  • Proficiency in scripting languages (Python, Bash, PowerShell, etc.).
  • Expert in ServiceNow ITSM and familiar with integration with AWS, Azure, GCP and OCI in service catalog.
  • Expert in Cloud Cost Optimization. Familiar with Apptio Cloudability.
  • Strong expertise in multi-cloud disaster recovery.
  • Familiarity in AppGate SDP, Qualys TotalCloud, Qualys Patch Management, Qualys CSAM, CrowdStrike, Palo Alto NGFW, etc.
  • Ability to analyze logs and monitor performance using native tools (CloudWatch, Azure Monitor, Stackdriver, OCI Monitoring, etc.)
  • Strong understanding of backup strategy, disaster recovery, and high-availability architecture.
  • Be able to support customer remote file service in Azure File Share in Azure Gov Cloud.
  • Have strong expertise in multi-cloud security compliance, data encryption, network security, user access control, private endpoint setup, etc.
  • Have strong expertise in GitHub and Repository management
  • Be able to set up rules/thresholds in Azure Monitor, AWS CloudWatch, GCP Monitoring and OCI Monitoring to generate alerts and connect with ServiceNow Incident Ticketing
  • Be able to connect multi-cloud VMs and instances with Microsoft Sentinel SIEM
  • Be able to support customer self-provision cloud instances with required security (guardrail) via Azure Blueprints, AWS Control Tower, etc.
  • (Preferred) DevOps or automation certifications (e.g., Kubernetes, Terraform).
  • (Preferred) ITIL Foundation or other support framework knowledge.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service