Join NVIDIA as a Solution Architect on the Infrastructure Specialists team. Help redefine deep learning, data analytics, and power data centers worldwide using NVIDIA products. Collaborate on building the world's largest and fastest AI Factories and supercomputers. We are seeking a candidate who can lead the planning and deployment of large scale AI data centers, focusing on infrastructure buildout including power and cooling systems, telemetry and control systems, and large scale design, construction and delivery processes. In this role, your main focus will be to support customers in the areas of planning, design, construction, and deployment of large scale AI factories. You will be a part of the team building capabilities to design, construct and deliver large AI factories based on NVIDIA's reference designs. This includes architectural systems, power distribution, cooling systems, integration of telemetry and control systems, and all other physical infrastructure. Collaboration with product and engineering teams, customers, and the partner/provider ecosystem will be crucial to achieving successful deployments. What you will be doing: NVIS Data Center deployment planning: Collaborate with product and engineering teams to understand NVIDIA’s reference architectures for data center infrastructure including power distribution, cooling systems, controls and monitoring, and network/cabling architecture. Support customers and partners in quickly implementing this architecture into advanced and reliable data center designs. Building process capabilities: Collaborate across the org to build processes, partner relationships and workflows to deliver and deploy large AI factories at speed of light (SOL). Design and construction oversight: Review and appraise customers' and partners' infrastructure design plans, verifying their compliance with NVIDIA reference architecture, industry standards, and regulatory requirements. Deliver guidance, expertise and suggestions to optimize performance, scalability, and cost-effectiveness. Ensure alignment with our customers and partners on reference architecture, guidelines and processes to make their deployments successful. Assess the operational efficiency, reliability, and readiness of data center infrastructure components before deploying AI/HPC clusters. Develop and implement comprehensive audit plans and conduct pre-deployment audits to identify potential issues, risks, and areas for improvement. Partner and vendor ecosystem: Develop and sustain a strong ecosystem of manufacturers, service providers and partners as needed, to ensure customers can deploy NVIDIA solutions rapidly and reliably. Be the key liaison for customers and partners on matters of data center infrastructure. Act as the NVIS mentor providing guidance, mentorship, and support to ensure the team's success in their respective roles. Quality Assurance: Implement and make quality assurance processes to ensure that deployments meet established specifications and performance benchmarks. Conduct detailed bring-up, testing, and commissioning to validate the functionality and reliability of infrastructure components. Continuous Improvement: Drive continuous improvement initiatives to improve data center infrastructure reliability, resilience, and sustainability. Find opportunities to streamline processes, automate repetitive tasks, and apply new technologies to optimize infrastructure operations. Collaboration and Communication: Collaborate and communicate across internal teams, external vendors, and customers to facilitate the flawless integration of data center infrastructure solutions. Serve as a domain authority and point of contact for infrastructure-related inquiries and critical issues.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level