About the position
OpenPhone is seeking a skilled individual to join their team in a high-impact role focused on creating and maintaining polished policies, procedures, and technical documentation. The successful candidate will collaborate closely with the Engineering team to ensure compliance and peak performance of critical systems through automated processes. They will also be responsible for managing and monitoring diverse Kubernetes clusters, with a goal of achieving an impressive 99.99% service availability. This role offers the opportunity to work with cutting-edge technology and contribute to the development of zero-downtime solutions for highly available services and disaster recovery.
Responsibilities
- Collaborate with the DevOps team and work independently to create high-quality policies, procedures, and technical documentation.
- Ensure infrastructure compliance with specifications and maintain business-critical systems through automated processes.
- Manage and monitor diverse Kubernetes clusters in various infrastructures, striving for 99.99% service availability.
- Design and implement zero-downtime solutions for highly available services and disaster recovery across different regions.
- Contribute to capacity planning, anticipate performance bottlenecks, and facilitate environment scaling.
- Monitor, troubleshoot, and resolve issues in all environments, including production incidents.
- Implement CI/CD pipelines using GitHub Actions for efficient software delivery.
- Promote security awareness, produce supporting materials, and address current threat landscape concerns.
- Take a risk-based approach to information security and manage third-party risks and mitigation processes.
- Stay updated on ecosystem challenges, exploits, and developments to proactively address potential vulnerabilities.
Requirements
- Skilled Linux user with experience in Source Code and Document Management Systems
- Proficient coding/scripting in at least one modern language for application development or utilities
- Knowledgeable in building, running, and securing Docker containers
- Familiarity with configuring, securing, and orchestrating containers and microservices, especially using Kubernetes
- Ability to analyze data sets and generate reports using tools like SQL, POSIX stream processing, spreadsheets, ODBC, and Python
- In-depth understanding of information security and risk management
- Experience with CI/CD pipelines using GitHub Actions for efficient software delivery
- Strong problem-solving and troubleshooting skills
- Excellent written and verbal communication skills
Benefits
- High impact, high volume, and pivotal projects in cutting-edge technology
- Collaboration with the DevOps team and independent work
- Creation of high-quality policies, procedures, and technical documentation
- Infrastructure compliance and maintenance of business-critical systems through automated processes
- Management and monitoring of diverse Kubernetes clusters with a focus on service availability
- Design and implementation of zero-downtime solutions for highly available services and disaster recovery
- Contribution to capacity planning and environment scaling
- Monitoring, troubleshooting, and resolution of issues in all environments
- Implementation of CI/CD pipelines for efficient software delivery
- Promotion of security awareness and addressing current threat landscape concerns
- Risk-based approach to information security and management of third-party risks
- Opportunity to stay updated on ecosystem challenges, exploits, and developments
- Fully remote work environment with asynchronous collaboration
- Inclusive and diverse work environment without discrimination based on various factors