POSITION SUMMARY: We are seeking a hands-on Lead Architect to design, implement, and maintain scalable, secure, reliable, and cost-efficient cloud infrastructure and DevOps/SRE solutions. The ideal candidate will have deep technical expertise in Azure (and optionally GCP), infrastructure automation, and modern DevOps/SRE practices. This role combines architectural leadership with hands-on execution, ideal for someone who enjoys designing systems and building reliable solutions ESSENTIAL FUNCTIONS: Architect, build, and manage highly available, resilient, and scalable cloud infrastructure across Azure (and optionally GCP). Develop Infrastructure as Code (IaC) using Terraform and Ansible for repeatable, version-controlled deployments. Design, deploy, and operate containerized applications using Docker and Kubernetes. Build and maintain CI/CD pipelines with GitHub Actions for efficient and secure software delivery. Implement and maintain monitoring, logging, and alerting using OpenTelemetry (OTel), Elastic Cloud, and other observability tools. Automate operational tasks using Python and Bash scripting. Architect and manage networking, load balancing, and DNS configurations across multi-region cloud environments. Design and implement high-availability, active-active, and multi-region architectures leveraging global load balancers. Collaborate with application teams to ensure performance, scalability, and cost efficiency. Drive cloud governance and cost optimization strategies. Provide technical mentorship to DevOps and engineering teams. ALL OTHER DUTIES AS ASSIGNED EXPERIENCE/QUALIFICATIONS: Minimum Degree Required: Bachelor's Degree 7+ years of experience in cloud infrastructure design and deployment. Proven expertise in Azure; exposure to GCP is a plus. Advanced proficiency with Terraform, Ansible, Docker, Kubernetes, and GitHub Actions. Strong scripting skills in Python and Bash. Solid understanding of networking concepts (VNets, subnets, routing, firewalls, DNS, load balancing). Hands-on experience with high-availability, distributed, multi-region architectures. Experience with monitoring and observability stacks (OpenTelemetry, Elastic Cloud, Prometheus, Grafana). Demonstrated ability in cloud cost optimization and governance best practices. Strong problem-solving, analytical, and communication skills.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Number of Employees
5,001-10,000 employees