Principal Site Reliability Engineer

Fidelity InvestmentsSmithfield, RI
1dHybrid

About The Position

Are you interested in joining a dynamic, innovation-focused, and high impact team? Do you enjoy building new products and services in a collaborative, fast-paced environment? Join Fidelity Labs, our in-house new business incubator, as one of the founding Principal Site Reliability Engineers for an early-stage SaaS platform targeting the Charitable sector. In this role, you will be responsible for working within the development team in collaboration with product teams to build and scale a modern SaaS solution. The Software Engineering team delivers next-generation software application enhancements and new products for a changing world. Working at the cutting edge, we design and develop software for platforms, applications and diagnostics — all with the most advanced technologies, tools, software engineering methodologies and the collaboration of internal and external partners. Role Overview As a Senior Site Reliability Engineer, you will ensure the reliability, performance, and scalability of our new SaaS application. You will design and maintain cloud infrastructure, implement DevOps pipelines, and collaborate with enterprise DBA and cloud teams to support our YugabyteDB database infrastructure. This role combines hands-on technical work with strategic reliability engineering, requiring expertise in cloud platforms, networking, and database systems while maintaining compliance with financial regulations.

Requirements

  • 5+ years of Site Reliability Engineering or DevOps experience with cloud platforms (AWS, Azure, or GCP) including compute, storage, networking, and managed services
  • Proficiency with Infrastructure as Code tools (AWS CDK, Terraform, CloudFormation) and scripting languages (TypeScript for CDK, Python, Bash, PowerShell)
  • Experience with database administration concepts and distributed databases, preferably YugabyteDB or similar (PostgreSQL, CockroachDB)
  • Experience with Liquibase, Flyway or similar tools for managing database schema changes
  • Strong understanding of cloud networking, security groups, VPCs, load balancers, and DNS management
  • Experience building and maintaining CI/CD pipelines using Jenkins, GitLab CI, GitHub Actions, or Azure DevOps
  • Knowledge of monitoring and observability tools (Prometheus, Grafana, Datadog, CloudWatch) and incident management practices
  • Strong Linux system administration skills and containerization experience (ECS, Docker, Kubernetes)
  • Excellent problem-solving skills with ability to troubleshoot complex distributed systems and work independently

Nice To Haves

  • Experience with Spring Boot (Java) or Angular (TypeScript) application deployment and optimization
  • Experience with FinOps automation for cost-effective resource usage
  • Cloud certifications (AWS Solutions Architect, Azure Solutions Architect, CKA/CKAD)
  • Experience in financial services or regulated industries with compliance requirements
  • Knowledge of chaos engineering, fault injection, and reliability testing practices

Responsibilities

  • Design and maintain highly available cloud infrastructure using Infrastructure as Code (AWS CDK, Terraform, CloudFormation)
  • Build and optimize CI/CD pipelines for automated testing, deployment, and monitoring of applications
  • Implement and manage containerized applications using ECS, Docker, and Kubernetes with focus on reliability and performance
  • Monitor system performance, availability, and security across all environments using observability tools
  • Collaborate with enterprise DBA teams to support YugabyteDB database operations, performance tuning, and disaster recovery
  • Automate operational tasks, implement backup/disaster recovery procedures, and establish SLAs/SLOs
  • Participate in on-call rotation, incident response, and post-mortem analysis to drive continuous improvement
  • Ensure compliance with financial regulations and security best practices while mentoring team members
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service