Systems And Storage Engineer

CriticalTiltAtlanta, GA
21h

About The Position

CriticalTilt blends 25+ years of specialized experience with a lean, responsive approach, delivering tailored solutions to government agencies and private sector clients. From navigating complex networks to adapting to new compliance demands, we understand our customers’ challenges and are primed to tilt the board towards success for their projects. Position Overview CriticalTilt, Inc. is seeking a highly skilled Systems and Storage Engineer to design, implement, and support enterprise-class compute and storage infrastructure for new datacenter deployments. This role is responsible for leading the technical configuration, integration, and validation of compute and storage systems following established engineering standards and project requirements. The Systems and Storage Engineer will work in close collaboration with Project Managers, Datacenter Technicians, and customer engineering teams to ensure successful, secure, and optimized deployments across multiple locations. This role requires deep technical knowledge of server, storage, and virtualization platforms, particularly in high-performance environments. The ideal candidate will have hands-on experience integrating systems such as VMware, DataDirect Networks (DDN), NetApp, and WekaIO (Weka), and the ability to translate design intent into fully functioning infrastructure.

Requirements

  • U.S. Citizenship required.
  • Bachelor’s degree in Computer Science, Information Systems, Engineering, or a related field; or equivalent experience.
  • Minimum 5 years of experience in systems engineering, storage architecture, or infrastructure deployment roles.
  • Proven expertise with VMware virtualization technologies (ESXi, vCenter, cluster configuration).
  • Strong knowledge of enterprise storage systems, including DataDirect Networks (DDN), NetApp, and WekaIO, or comparable platforms.
  • Proficiency with Ethernet and InfiniBand networking, including switch configuration, topology design, and performance troubleshooting.
  • Experience managing and integrating GPU-based compute systems into virtualized and storage environments.
  • Strong documentation, communication, and cross-functional collaboration skills.
  • Ability to work independently in complex, fast-paced project environments with frequent travel.

Nice To Haves

  • Familiarity with automation, scripting, or configuration management tools (e.g., Ansible, PowerShell, Python) preferred.
  • Understanding of air and liquid cooling systems as they relate to high-performance compute environments.

Responsibilities

  • Lead and execute system and storage configuration, integration, and validation for new datacenter deployments.
  • Perform detailed setup and management of VMware, DataDirect Networks (DDN), NetApp, and WekaIO environments, including cluster formation, storage provisioning, and network integration.
  • Collaborate with Datacenter Technicians to guide and verify rack, stack, cabling, and base configuration activities.
  • Manage Ethernet and InfiniBand networking for storage and compute systems, ensuring proper connectivity and performance.
  • Implement and maintain virtualization, storage, and data management best practices across systems and platforms.
  • Conduct performance tuning, troubleshooting, and system optimization in support of operational readiness.
  • Develop and maintain configuration documentation, diagrams, and playbooks to ensure repeatable, standardized deployments.
  • Support hardware and software lifecycle management, including patching, firmware updates, and capacity planning.
  • Collaborate with Project Managers to define and meet project milestones, ensuring on-time and compliant delivery.
  • Interface with customer engineers to integrate solutions into broader system environments and provide Tier 3 escalation support.
  • Ensure all systems comply with applicable security, data protection, and compliance requirements.
  • Provide technical mentorship to Datacenter Technicians and contribute to continuous improvement initiatives.
  • Utilize Datacenter Infrastructure Management (DCIM) and IP Address Management (IPAM) tools for documentation and system oversight.
  • Travel frequently to customer sites for system deployment, validation, and testing activities.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service