Senior Lead Software Engineer- AI Platform engineer

JPMorgan Chase & Co.San Francisco, CA

About The Position

Be an integral part of an agile team that's constantly pushing the envelope to enhance, build, and deliver top-notch technology products. As a Senior Lead Software Engineer at JPMorgan Chase within the Corporate Sector, Infrastructure Platforms team, you are an integral part of an agile team that works to enhance, build, and deliver trusted market-leading technology products in a secure, stable, and scalable way. Drive significant business impact through your capabilities and contributions, and apply deep technical expertise and problem-solving methodologies to tackle a diverse array of challenges that span multiple technologies and applications.

Requirements

  • Formal training or certification on software engineering concepts and 5+ years applied experience
  • Hands-on experience in system design, application development, testing, and operational stability.
  • Strong experience with Kubernetes and containerization (Docker), including cluster operations and production troubleshooting.
  • Proficiency in at least one programming language, such as Python, Go, Java, or C#.
  • Ability to independently tackle design and functionality problems with minimal oversight.
  • Strong knowledge of cloud computing delivery models (IaaS, PaaS, SaaS) and deployment models (Public, Private, Hybrid Cloud).
  • Foundational understanding of machine learning concepts, including transformer architecture, ML training, and inference.
  • Experience with Infrastructure as Code.
  • Deep understanding of cloud component architecture: Microservices, IaaS, Storage, Security, and routing/switching technologies.

Nice To Haves

  • Foundational understanding of NVIDIA GPU infrastructure software (e.g., DCGM, BCM, Dynamo Inference).
  • Proficiency with observability tools like Prometheus and Grafana.
  • Experience in ML Ops and related tooling, including MLflow.
  • Background in high performance computing and ML frameworks (e.g., vLLM, Ray.io, Slurm).
  • Strong knowledge of network architecture, database programming (SQL/NoSQL), and data modeling.
  • Familiarity with cloud data services, big data processing tools, and Linux environments (scripting and administration).

Responsibilities

  • Provide technical guidance and direction to support business objectives, collaborating with technical teams, contractors, and vendors.
  • Develop secure, high-quality production code, and review and debug code written by others.
  • Influence product design, application functionality, and technical operations through informed decision-making.
  • Advocate for firm wide frameworks, tools, and practices within the Software Development Life Cycle.
  • Promote a culture of diversity, equity, inclusion, and respect within the team.
  • Architect and deploy secure, scalable cloud infrastructure platforms optimized for AI and machine learning workloads.
  • Collaborate with AI teams to translate computational needs into infrastructure requirements.
  • Monitor, manage, and optimize cloud resources for performance and cost efficiency.
  • Design and implement continuous integration and delivery pipelines for machine learning workloads.
  • Develop automation scripts and infrastructure as code to streamline deployment and management tasks.

Benefits

  • comprehensive health care coverage
  • on-site health and wellness centers
  • a retirement savings plan
  • backup childcare
  • tuition reimbursement
  • mental health support
  • financial coaching
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service