Modern Government Solutions-posted 6 days ago
Full-time • Mid Level
Onsite • Tysons, VA

Modern Government Solutions (MGS) is seeking a Senior Platform Storage Engineer to design, deploy, and maintain enterprise storage infrastructure that powers large-scale Artificial Intelligence (AI) workloads. This role focuses on building secure, high-performance, and scalable storage solutions across Linux-based and hybrid environments. The ideal candidate will have deep expertise in SAN/NAS technologies, NVMe systems, and automation tools, working collaboratively with cross-functional teams to ensure reliability, efficiency, and data security within mission-critical platforms.

  • Design, deploy, and maintain enterprise storage solutions supporting large-scale AI and high-performance computing environments.
  • Configure and manage SAN/NAS infrastructure using technologies such as iSCSI, Fibre Channel, NFS, and NVMe .
  • Optimize storage performance, capacity, and reliability across Linux-based systems and hybrid environments.
  • Implement data replication, backup, and disaster recovery solutions to ensure business continuity.
  • Automate provisioning, monitoring, and maintenance tasks using scripting and Infrastructure-as-Code tools (e.g., Ansible, Chef, Puppet).
  • Integrate and secure storage systems with encryption at rest and in transit.
  • Monitor performance metrics with tools like Prometheus, Grafana, or Zabbix to identify and resolve bottlenecks.
  • Design storage architectures spanning on-premises and cloud platforms (AWS S3, Azure Blob, GCP Storage).
  • Collaborate with cross-functional teams to deliver scalable, secure, and compliant storage infrastructure.
  • Mentor junior engineers and lead initiatives to enhance system performance and operational efficiency.
  • Travel up to 20% for on-site installations, maintenance, and troubleshooting at customer or datacenter locations.
  • Must possess an active Department of Defense (DoD) TS/SCI with Counterintelligence (CI) Polygraph.
  • Bachelor's or Master's degree in Computer Science , Engineering, or related field (or equivalent experience).
  • Current IAM Level II certification (e.g., Security+ CE, CAP, CASP) per DoD 8570 IAT requirements.
  • 8+ years of hands-on experience designing, deploying, and managing enterprise storage infrastructure.
  • 5+ years of experience with SAN/NAS technologies such as iSCSI, Fibre Channel, FCoE , NFS, SMB/CIFS, NVMe /TCP, and NVMe /RoCE.
  • Advanced proficiency with RAID configuration, disk management, and file systems (ext4, XFS, Btrfs ).
  • Experience designing hybrid or multi-cloud storage solutions, including AWS S3, Azure Blob, or Google Cloud Storage.
  • Experience in designing and implementing disaster recovery and data replication strategies (synchronous/asynchronous).
  • Demonstrated experience managing enterprise storage platforms from Dell EMC, NetApp, or HPE Nimble, including firmware and performance optimization.
  • Experience implementing and using monitoring and performance tools (Prometheus, Grafana, Zabbix) to analyze I/O and optimize throughput.
  • Proficiency in scripting (Python, Bash, or similar) to automate configuration, monitoring, and reporting tasks.
  • Strong understanding of data encryption at rest and in transit, and data tiering strategies for performance and cost efficiency.
  • Excellent problem-solving and communication skills with the ability to lead technical initiatives and mentor junior engineers.
  • Experience with Infrastructure-as-Code tools (Ansible, Chef, or Puppet) for storage automation and configuration management.
  • Experience administering Linux systems (RHEL, CentOS, Ubuntu) including user management, security hardening, and troubleshooting.
  • Experience optimizing for NVMe -based systems and managing flash memory performance.
  • Designing solutions that span both on-premises and cloud environments.
  • Familiarity with Ceph, Swift, or other object storage platforms.
  • Knowledge of Lustre , GPFS/Spectrum Scale - critical for high-performance computing environments.
  • Understanding how storage integrates with job schedulers like Slurm or PBS.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service