Senior Manufacturing Infrastructure Manager

d-MatrixSanta Clara, CA
Hybrid

About The Position

At d-Matrix, we are focused on unleashing the potential of generative AI to power the transformation of technology. We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. Our culture is one of respect and collaboration. We value humility and believe in direct communication. Our team is inclusive, and our differing perspectives allow for better solutions. We are seeking individuals passionate about tackling challenges and are driven by execution. Ready to come find your playground? Together, we can help shape the endless possibilities of AI.

Requirements

  • Advanced Linux proficiency (RHEL, CentOS, Ubuntu) including system installation, configuration, and optimization
  • Command-line expertise with shell scripting (Bash, Python) for automation
  • System monitoring, performance tuning, and capacity planning
  • Package management and dependency resolution
  • File system management, storage solutions, and backup strategies
  • PostgreSQL administration including installation, configuration, and optimization
  • Database performance monitoring, query optimization, and indexing strategies
  • Backup and recovery procedures, including point-in-time recovery
  • High availability setups (replication, clustering)
  • Database security, user management, and access controls
  • Experience with database migrations and schema changes
  • Containerization technologies (Docker, Kubernetes) for application deployment
  • Configuration management tools (Ansible, Puppet, Chef)
  • Infrastructure as Code (Terraform, CloudFormation)
  • CI/CD pipeline setup and maintenance (Jenkins, GitLab CI, GitHub Actions)
  • Version control systems (Git) and branching strategies
  • Monitoring and logging solutions (Prometheus, Grafana, ELK stack)
  • Scripting languages: Python, Bash, PowerShell
  • Infrastructure automation and orchestration
  • Understanding data pipeline creation and ETL processes
  • High availability system design with 99.9%+ uptime requirements
  • Disaster recovery planning and implementation
  • Network troubleshooting and optimization for manufacturing environments
  • Understanding of industrial networking (Ethernet/IP, Profinet)
  • Experience with redundant systems and failover procedures
  • Cloud platform experience (AWS, Azure, GCP) for hybrid manufacturing solutions
  • Edge computing deployment for factory floor applications
  • VPN setup and management for secure remote access
  • Load balancing and auto-scaling configurations
  • Strong troubleshooting methodology and root cause analysis
  • Ability to work under pressure during production outages
  • Clear communication with both technical and non-technical stakeholders
  • Documentation skills for procedures, runbooks, and system architecture
  • 24/7 on-call availability for critical production systems
  • Ability to work in industrial environments (factory floors, clean rooms)
  • Cross-functional collaboration with production, quality, and engineering teams

Nice To Haves

  • Minimum 7 yrs of Industry Experience - Preferred 3+ years in manufacturing, industrial automation, or similar environments
  • Linux certifications (RHCE, LPIC)
  • PostgreSQL certifications (PostgreSQL CE)
  • Cloud provider certifications (AWS Solutions Architect, Azure Administrator)

Responsibilities

  • Design, implement, and maintain scalable infrastructure supporting manufacturing operations
  • Ensure system reliability, security, and performance for production-critical applications
  • Troubleshoot complex technical issues with minimal downtime impact
  • Collaborate with manufacturing teams to optimize system integration and data flow
  • Develop and maintain automation tools, monitoring systems, and recovery procedures
  • Stay current with emerging technologies and manufacturing industry best practices
  • Advanced Linux proficiency (RHEL, CentOS, Ubuntu) including system installation, configuration, and optimization
  • Command-line expertise with shell scripting (Bash, Python) for automation
  • System monitoring, performance tuning, and capacity planning
  • Package management and dependency resolution
  • File system management, storage solutions, and backup strategies
  • PostgreSQL administration including installation, configuration, and optimization
  • Database performance monitoring, query optimization, and indexing strategies
  • Backup and recovery procedures, including point-in-time recovery
  • High availability setups (replication, clustering)
  • Database security, user management, and access controls
  • Experience with database migrations and schema changes
  • Containerization technologies (Docker, Kubernetes) for application deployment
  • Configuration management tools (Ansible, Puppet, Chef)
  • Infrastructure as Code (Terraform, CloudFormation)
  • CI/CD pipeline setup and maintenance (Jenkins, GitLab CI, GitHub Actions)
  • Version control systems (Git) and branching strategies
  • Monitoring and logging solutions (Prometheus, Grafana, ELK stack)
  • Scripting languages: Python, Bash, PowerShell
  • Infrastructure automation and orchestration
  • Understanding data pipeline creation and ETL processes
  • High availability system design with 99.9%+ uptime requirements
  • Disaster recovery planning and implementation
  • Network troubleshooting and optimization for manufacturing environments
  • Understanding of industrial networking (Ethernet/IP, Profinet)
  • Experience with redundant systems and failover procedures
  • Cloud platform experience (AWS, Azure, GCP) for hybrid manufacturing solutions
  • Edge computing deployment for factory floor applications
  • VPN setup and management for secure remote access
  • Load balancing and auto-scaling configurations
  • Strong troubleshooting methodology and root cause analysis
  • Ability to work under pressure during production outages
  • Clear communication with both technical and non-technical stakeholders
  • Documentation skills for procedures, runbooks, and system architecture
  • 24/7 on-call availability for critical production systems
  • Ability to work in industrial environments (factory floors, clean rooms)
  • Cross-functional collaboration with production, quality, and engineering teams
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service