About The Position

AI Solution Principal Systems Development Engineer Our customers’ system requirements are usually highly complex. Bringing together hardware and software systems design, Systems Development Engineering operates at the very cutting edge of technology to meet them. We design and develop electronic and electro-mechanical or systems-orientated products, conduct feasibility studies on engineering proposals and prepare installation, operation and maintenance specifications and instructions. We’re proud to deliver programs and products to the highest standards, on time and within budget. Join us to do the best work of your career and make a profound social impact as an AI Solution Principal Systems Development Engineer on our AI & HPC Solutions Engineering Team in Austin Texas, United States . What you’ll achieve As an AI Solution Principal Systems Development Engineer , you will design, define and implement complex system requirements for customers and prepare studies and analyses of existing systems. You will: Deliver assigned system interfaces with design and process across extended teams Prepare documentation for inspection/testing procedures Design, develop and implement cost effective methods of testing and troubleshooting systems and equipment Prepare test and diagnostic programs, design test fixtures and equipment and complete specifications and procedures for new products Take the first step towards your dream career Every Dell Technologies team member brings something unique to the table. Here’s what we are looking for with this role:

Requirements

  • 6–12 years’ experience with advanced understanding of modern system architectures and deep knowledge of hardware–software interactions to diagnose and resolve complex system issues.
  • Hands-on experience with NVIDIA GPUs, including strong familiarity with NVIDIA’s hardware, software, and management stack, as well as experience with Dell servers, storage, networking, and related system software.
  • Strong background in Core AI, GenAI, and ML technologies, including large‑scale model customization methods (training, inferencing, RAG, etc.) and integrating AI workflows with modeling/simulation (Mod/SIM) and data analysis pipelines.
  • Practical expertise with modern orchestration and cluster‑management platforms, including Kubernetes, OpenShift, Bright Cluster Manager (BCM), and Slurm for scheduling, deployment, and workload management.
  • Demonstrated senior‑level capability in system design and integration, applying architectural insight, debugging skills, and deep technical judgment to lead development, performance tuning, and issue resolution across complex environments.

Nice To Haves

  • Experience on AI tools and Models, including Hugging Face, Meta, ML Frameworks like Pytorch and Jupyter, MLOps such as mlflow, etc
  • Experience with containerized enviroments and programming environments (+ Portability) for automation and scripting

Responsibilities

  • Deliver assigned system interfaces with design and process across extended teams
  • Prepare documentation for inspection/testing procedures
  • Design, develop and implement cost effective methods of testing and troubleshooting systems and equipment
  • Prepare test and diagnostic programs, design test fixtures and equipment and complete specifications and procedures for new products

Benefits

  • Choice of medical coverage
  • Competitive bonus & commission programs
  • Wellness program with medical premium discounts
  • 401(k) Plan with before-tax and Roth contributions
  • Generous Time Off Programs
  • Team member discounts on Dell products
  • Fitness Reimbursement
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service