Being an Intern in our DevOps/Site Reliability Engineering (SRE) Team provides highly motivated and qualified students the opportunity to gain hands‑on experience in modern software delivery, platform reliability, and operational excellence. This role offers first‑hand exposure to both DevOps /SRE practices, including how we design, build, deploy, operate, and continuously improve reliable systems across on‑premises platforms and cloud‑based applications. Interns will learn how DevOps and SRE work together to enable fast, safe software delivery while maintaining high standards for reliability, scalability, performance, and security. The intern will collaborate closely with development, operations, and platform teams to learn and apply best practices for CI/CD automation, infrastructure as code, monitoring, observability, and reliability engineering. They will gain exposure to enterprise‑scale systems and interact with engineers, managers, and senior leaders to better understand the intersection of technology, operations, and business within the insurance industry. Through this experience, the intern will build a strong understanding of the end‑to‑end technology stack - including networking, storage, operating systems, virtualization, databases, applications, and cloud services—to observe, monitor, troubleshoot, and automate activities within the Berkley environment. Key functions will include but are not limited to: Support DevOps and SRE monitoring, observability, and automation through scripting, SQL, and tooling. Assist in defining and tracking reliability metrics, including SLIs, SLOs, and error budgets. Help implement and maintain monitoring and alerting to proactively identify performance and reliability issues. Contribute to CI/CD pipelines, deployment automation, and operational tooling that support reliable software delivery. Assist with incident response, troubleshooting, and performance analysis to minimize downtime and user impact. Collaborate with development and operations teams to embed reliability, resiliency, and observability into systems. Research emerging DevOps and SRE tools and practices and share findings with the team. Document standards, configurations, issues, and resolutions to support operational consistency. Demonstrate strong communication, prioritization, and multitasking skills in a fast‑paced environment.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Career Level
Intern
Education Level
No Education Listed