About the position
We are looking for a highly skilled DevOps Engineer to build and operate big data infrastructures, fast-changing web apps, real-time APIs, and handle a lot of traffic. The ideal candidate will provide expertise in the design, implementation, and operation of scalable distributed systems, develop new methods and tools to automate and self-heal production environments, and apply software development workflows to operational environments. They will also be responsible for testing and tuning newly developed systems to prepare them for production deployment and ensuring maximum performance at minimum cost. The role requires a full understanding of GitOps, container orchestration, and the key pillars of observability. The successful candidate will have a Bachelor's degree or higher in Computer Science and at least 6 years of professional experience with automated unit and integration tests, Continuous Integration/Continuous Deployment (CI/CD), and experience with at least one programming language.
Responsibilities
- Provide expertise in the design, implementation, and operation of scalable distributed systems to assist development teams in making the right decisions early
- Develop new methods and tools to automate and self-heal production environments
- Apply software development workflows to operational environments
- Test and tune newly developed systems to prepare them for production deployment and ensure maximum performance at minimum cost
- Automate packaging, deployment, and configuration of internally developed applications
- Be the expert in our internal applications, from the high-level architecture down to the code
- Implement new application features, especially features supporting operational excellence: stability, scalability, redundancy, etc.
- Build tools that make your colleagues more effective
- Lead, coach and grow our GitDevSecOps practice by bridging development, Ops and QA
- Become and stay an expert in current and emerging technologies and tools
- Contribute to Open Source solutions and communities we use wherever you can
- Measure everything, providing critical operational insight into our applications
Requirements
- Bachelor's degree or higher in Computer Science
- Experience in the early-mid stages of a fast-growing company
- 6+ years of professional experience with automated unit and integration tests and Continuous Integration/Continuous Deployment (CI/CD)
- Full understanding of GitOps: CI (Github Actions), CD (ArgoCD, Flux), IaC (Terraform) and experience with Container Orchestration (Docker, Kubernetes, Helm), the key pillars of Observability (Logging, Metrics and Tracing), using tools like OpenTelemetry, Prometheus, DataDog etc, Monitoring and Alerting using Kubernetes Prometheus stack (Prometheus, Grafana, AlertManager), and at least one programming language (preferably Go, Python, Scala, and/or Java, but any language will do)