About the position
We are seeking an InfraOps Engineer with experience in delivering and operating IT infrastructure for both private and public clouds. The ideal candidate will have expertise in Linux, network integration, storage, and virtualization, as well as a strong understanding of DevOps and SRE approaches. Responsibilities include ensuring system availability, performance, and efficiency, as well as change management and capacity planning. The InfraOps team plays a crucial role in providing automation and monitoring infrastructure to enhance the overall experience of Nubankers. Desired skills include Unix expertise, scripting languages, web development, version control systems, and knowledge of cloud deployment and monitoring systems. Fluency in English and Spanish, along with experience working with global teams, is also required.
Responsibilities
- Delivery and operation of IT infrastructure for small (US$10M/ >1000VMs) servers Infrastructure in private clouds (VMware &/OpenStack) and in Public Clouds (AWS, Azure, Google Cloud)
- Automation at the operational level and monitoring of infrastructure
- Participation in project HLD/ LLD, hardware, IaaS, PaaS, SaaS installation, application commissioning, network integration, interoperability tests, acceptance tests procedures (ATPs)
- Operation & Maintenance during the commercial stage in large IT infrastructure for production environments using Linux
- Working with infrastructure network, storage, and virtualization with a DevOps culture and SRE approach
- Responsible for services SLO and system availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning
- Delivering automation at the operational level and monitoring of infrastructure to make the experience of Nubankers more comfortable
- Gathering metrics to ensure the decisions made are the right ones and providing unique solutions
- Advance Unix Expertise, hands-on experience working in any UNIX CLI derivative environments (Any Linux Distro & MacOs)
- Basic to intermediate experience reading and programming scripting languages such as Bash/Shell and Python to automate tasks
- Advanced experience reading and programming web development languages like PHP, CSS, JS, HTML
- Intermediate to advanced experience reading and programming with web development Python modules like Django, Flask, Pandas, Requests
- Intermediate to advanced experience with optimizing and building scalable web services
- Intermediate to advanced experience deploying and managing Web Servers such as Apache or Nginx
- Intermediate to advanced experience working with and building production-ready REST APIs
- Intermediate to advanced experience working with and building production-ready highly-secure web applications and services to internal and external customers
- Strong knowledge of version control systems like Git (Github, Gitlab)
- Knowledge of Apple MDM tools (Mosyle, JAMF)
- Knowledge of Infrastructure as Code methodologies, tools for configuration management (Ansible, Puppet, Chef)
- Knowledge of high-level scripting languages (e.g. Python, Go, Clojure, Bash)
- Knowledge of automated installation and implementation of end-user settings (Ubuntu and MacOS)
- Knowledge of deploying and configuring applications in AWS-Cloud and VMWare
- Knowledge of monitoring systems (Prometheus, Telegraf, Grafana, CheckMK, Splunk, Alert Manager, APIs, etc)
- Knowledge of Docker (Containers, clusters, orchestrators)
- Knowledge of microservices and CI/CD pipelines
- Knowledge of SQL and NoSQL Databases (e.g. MySQL, MariaDB, Datomic, DynamoDB, PostgreSQL)
- Advanced Knowledge of the security-first approach when designing solutions
- Experience working with global teams and supporting customers remotely
- Experience working with High-Availability and High-Scalability systems in a production environment
- Efficient communication in English, Spanish
- Knowledge of Scrum/Kanban Agile methodologies
- Experience working with Atlassian tools like Confluence, Jira, OpsGenie
- Enjoying taking on great technical challenges with quality solutions and having a sense of urgency when prioritizing problems
Requirements
- Experience in delivery and operation of IT infrastructure for small (US$10M/ >1000VMs) servers Infrastructure in private clouds (VMware &/OpenStack) and in Public Clouds (AWS, Azure, Google Cloud)
- Experience in automation at the operational level and monitoring of infrastructure
- Participation in project HLD/ LLD, hardware, IaaS, PaaS, SaaS installation, application commissioning, network integration, interoperability tests, acceptance tests procedures (ATPs)
- Experience in Operation & Maintenance during the commercial stage in large IT infrastructure for production environments using Linux
- Experience with infrastructure network, storage, and virtualization
- DevOps culture and SRE approach
- Responsible for services SLO and system availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning
- Advance Unix Expertise, hands-on experience working in any UNIX CLI derivative environments (Any Linux Distro & MacOs)
- Basic to intermediate experience reading and programming scripting languages such as Bash/Shell and Python to automate tasks
- Advanced experience reading and programming web development languages like PHP, CSS, JS, HTML
- Intermediate to advanced experience reading and programming with web development Python modules like Django, Flask, Pandas, Requests
- Intermediate to advanced experience with optimizing and building scalable web services
- Intermediate to advanced experience deploying and managing Web Servers such as Apache or Nginx
- Intermediate to advanced experience working with and building production-ready REST APIs
- Intermediate to advanced experience working with and building production-ready highly-secure web applications and services to internal and external customers
- Strong knowledge of version control systems like Git (Github, Gitlab)
- Knowledge of Apple MDM tools (Mosyle, JAMF)
- Knowledge of Infrastructure as Code methodologies, tools for configuration management (Ansible, Puppet, Chef)
- Knowledge of high-level scripting languages (e.g. Python, Go, Clojure, Bash)
- Knowledge of automated installation and implementation of end-user settings (Ubuntu and MacOS)
- Knowledge of deploying and configuring applications in AWS-Cloud and VMWare
- Knowledge of monitoring systems (Prometheus, Telegraf, Grafana, CheckMK, Splunk, Alert Manager, APIs, etc)
- Knowledge of Docker (Containers, clusters, orchestrators)
- Knowledge of microservices and CI/CD pipelines
- Knowledge of SQL and NoSQL Databases (e.g. MySQL, MariaDB, Datomic, DynamoDB, PostgreSQL)
- Advanced Knowledge of the security-first approach when designing solutions
- Experience working with global teams and supporting customers remotely
- Experience working with High-Availability and High-Scalability systems in a production environment
- Efficient communication in English, Spanish
- Knowledge of Scrum/Kanban Agile methodologies
- Experience working with Atlassian tools like Confluence, Jira, OpsGenie
- Enjoying taking on great technical challenges with quality solutions and also having a sense of urgency when prioritizing problems
Benefits
- Chance of earning equity at Nubank
- Extended maternity and paternity leaves
- Health and life insurance
- NuCare - Our mental health and wellness assistance program
- Nucleo - Our learning platform of courses
- NuLanguage - Our language learning program
- Holiday Bonus ("Aguinaldo") of 30 days of pay per year
- 17 days of paid vacation with 25% vacation bonus
- Gym partnership