Position Description: Builds and monitors distributed and scalable full stack Web application in Amazon Web Services (AWS) Cloud environments. Performs re-hosting, re-platforming, and re-architecting for existing legacy on-prem applications migrated to AWS. Designs, develops, and deploys highly resilient and highly available multi-tier application platforms written in Python and supports Shell scripting languages. Automates infrastructure provisioning, deployment workflows, performance monitoring, and operational tasks using scripting languages Python, Golang, PowerShell, and Bash. Executes system provisioning, configuration management, and environment setup using Infrastructure as Code (IaC) tools -- Terraform, and Ansible. Applies Cloud Computing and DevOps concepts including Continuous Integration and Continuous Delivery (CI/CD) pipelines and Kubernetes to efficiently and reliable deploy systems at scale. Primary Responsibilities: Participates in architecture design teams. Defines and implements application-level architecture. Gathers business requirements and translates them into detailed technical specifications and modular components for implementation by the development team. Develops applications on complex projects, components, and subsystems for the division. Recommends development testing tools and methodologies and reviews and validates test plans. Ensures the quality and readiness of software deliverables. Implements automated testing, code reviews, and validation processes prior to release. Develops comprehensive documentation for multiple applications or subsystems. Establishes full project life cycle plans for complex projects across multiple platforms. Ensures projects timely completion and budgetary adherence. Advises on risk assessment and risk management strategies for projects. Plans and coordinates project schedules and assignments for multiple projects. Acts as a primary liaison for business units to resolve various project/technology issues. Provides technology solutions to daily issues and technical evaluation estimates on technology initiatives. Advises senior management on technical strategy. Mentors junior team members. Performs independent and complex technical and functional analysis for multiple projects supporting several divisional initiatives. Develops original and creative technical solutions to on-going development efforts. Education and Experience: Bachelor’s degree in Computer Science, Information Science, Engineering, Information Technology, Information Systems, or a closely related field (or foreign education equivalent) and five (5) years of experience as a Principal Software Engineer/Developer (or closely related occupation) designing, automating, and supporting Web-based applications in AWS and on-premise environments, using Cloud technologies and Application Performance Monitoring (APM) tools. Or, alternatively, Master’s degree in Computer Science, Information Science, Engineering, Information Technology, Information Systems, or a closely related field (or foreign education equivalent) and three (3) years of experience as a Principal Software Engineer/Developer (or closely related occupation) designing, automating, and supporting Web-based applications in AWS and on-premise environments, using Cloud technologies and Application Performance Monitoring (APM) tools. Skills and Knowledge: Candidate must also possess: Demonstrated Expertise (“DE”) building and maintaining full-stack applications using Node.js and React.js integrated with backend services via REST Application Programming Interfaces (APIs); and improving engineering productivity by providing custom User Interface (UI) based interfaces for incident tracking, configuration management, and change approvals. DE automating business and technology operations, including Kubernetes cluster rehydration, application recycling, patching, disaster recovery, and IT Service Management (ITSM) reporting, using Jenkins, uDeploy, RunDeck, Ansible, and AWS; designing and optimizing CI/CD pipelines with automated testing, security controls, and infrastructure provisioning, using Jenkins, Serverless Framework, Terraform, AWS CloudFormation, and SonarQube; performing unit testing and integration testing, using PyTest and unit test libraries; developing configuration management scripts using Ansible; and writing complex SQL/PL-SQL queries in SQL Server, Snowflake, and Oracle. DE researching performance bottlenecks to provide tuning recommendations, ensuring rapid root cause analysis, and reduce Mean Time to Detect (MTTD) and Mean Time to Respond (MTTR), using APM tools -- Dynatrace, New Relic, Splunk, and Datadog; and performing capacity planning to recommend CPU sizing, memory allocation, and process configuration management, using APM tools. DE instrumenting advanced observability practices at scale across Cloud-native and hybrid environments, and defining and tracking Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to ensure reliability and performance metrics, using Python automation and IaC methodologies, Datadog, Splunk, Dynatrace, Grafana, and ELK stack; developing custom dashboards, alerting rules, and automated incident response workflows to proactively detect and resolve performance degradations, using Datadog, Catchpoint, Grafana, ELK, and Cloudwatch; and enabling actionable insights through trace-level correlation of user Journeys (E2E) and system behaviors, using Dynatrace, Splunk, Draw.io, and Miro.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Number of Employees
5,001-10,000 employees