Senior Cloud Engineer

Brookhaven National LaboratoryRidge, NY
Remote

About The Position

Brookhaven National Laboratory is seeking a senior information technology cloud engineer with strong experience in Microsoft Azure-based infrastructures to join our Information Technology Division (ITD). The Information Technology Division's (ITD) mission is to deliver safe, efficient operations that ensure the delivery of the Lab's research mission by developing and deploying state-of-the-art information and computing systems, providing a reliable and secure, high-speed network infrastructure, developing a scientific computing infrastructure, providing support to scientific and administrative programs, providing a cost-effective, highly reliable, secure, and standardized computing infrastructure, promoting best practices, and providing cost-effective information services. ITD includes experts in information technology (IT) infrastructure and operations, cyber security, business systems, data information, and information services. The ideal candidate is an excellent communicator who is resourceful and innovative, and who understands that collaboration with stakeholders from across the Laboratory is critical to ensure the right expertise and technology are in place to advance our mission of bringing science solutions to the world. As a senior cloud engineer, you will help lead the design, development, implementation, and automation of cloud infrastructure services and solutions, primarily in the Microsoft Azure environment. This is a senior individual contributor role focused on establishing and maturing infrastructure engineering, automation, and platform development. You will focus on modernizing and improving our cloud toolset, processes, and practices while improving platform reliability and scalability. As a member of a small group of senior engineers, you will be expected to significantly contribute to the maturity of our IT infrastructure cloud engineering department. Your experience in partnering with developers, infrastructure teams, and stakeholders to align infrastructure with business and research needs will be crucial to your success. As a mixed on-premise and cloud-based IT organization, your ability to build and influence the direction of strategic IT roadmaps is critical. Additionally, fostering an infrastructure-as-code mindset while lifting the team’s overall maturity and growth will be important to long-term success in the role. This role also supports and enables the Lab’s evolving AI initiatives by providing secure, scalable infrastructure for AI-enabled applications. You will partner with our application, data, and AI teams to deliver reusable platform capabilities that support both traditional and AI-driven workloads.

Requirements

  • BA/BS degree, preferably in Information Technology or a related discipline, or equivalent certifications or experience.
  • 10+ years in cloud engineering, DevOps, and/or infrastructure roles demonstrating progressive growth in scope, responsibility, and complexity.
  • Strong expertise with Microsoft Azure services, including infrastructure, networking, computing, storage, security, identity, etc.
  • Strong platform orchestration tool experience across Linux and Windows devices.
  • Experience utilizing CI/CD tools and platforms.
  • Strong communication and documentation skills.

Nice To Haves

  • Experience with common security/compliance frameworks (ISO, SOC 2, NIST).
  • Experience in research and/or academic IT environments.
  • Experience with FinOps tools for usage and billing simplification.
  • Experience with the ownership and implementation platforms with a product-centric view, implementing lifecycle management and continuous improvement.
  • A strong understanding of core technologies used by partner organizations, including application deployment tools, database platforms, networking, cyber security, and monitoring.
  • An interest or experience with common AI/LLM platforms, including potential integration with other applications.
  • Capable of moving productions from ideation to delivery and operation, through deep technical experience and strong business acumen and understanding.
  • Strong communication and presentation skills with the ability to influence organizational priorities.

Responsibilities

  • Cloud services lifecycle management: Design, deploy, manage, and support Azure SaaS, PaaS, and IaaS services throughout their full lifecycle.
  • Ensure solutions are secure, scalable, resilient, and aligned with Laboratory standards and best practices.
  • Automation & Infrastructure-as-Code (IaC): Automate provisioning and operational workflows using common scripting and configuration tools.
  • Integrate infrastructure into CI/CD pipelines.
  • Continuously automate and eliminate manual processes.
  • Reliability, Security & Operations: Implement monitoring, logging, and alerting.
  • Troubleshoot infrastructure and application issues.
  • Participate in incident response and continuous improvement efforts.
  • Ensure infrastructure meets security, compliance, and governance standards.
  • Support cost optimization and performance tuning.
  • AI Platform Integration: Support infrastructure for AI-enabled applications (e.g., API-based model access, inference endpoints).
  • Establish scalable and secure patterns for integrating AI services into cloud environments.
  • Partner with engineering teams to enable safe and efficient use of AI capabilities within platform guardrails.
  • Mentorship and team development: Mentor junior staff and colleagues.
  • Promote knowledge sharing.
  • Contribute to a collaborative, high-performing team culture.

Benefits

  • Comprehensive employee benefits program
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service