About The Position

About the Team The Neuroboros team was recently created to pursue the ambitious goal of leveraging and expanding Generative AI technologies to help customers benefit from the scale and price/performance equation offered by Amazon Machine Learning hardware. The creation of the team in NYC is key to Annapurna Labs’ location strategy, with the goal of creating an additional hub attracting top talent with varied backgrounds to work on challenging problems, using and building state-of-the-art tooling. About Amazon Annapurna Labs: Amazon Annapurna Labs team (our organization within AWS UC) is responsible for building innovation in silicon and software for our AWS customers. We are at the forefront of innovation by combining cloud scale with the world’s most talented engineers. Our team covers multiple disciplines including silicon engineering, hardware design, software and operations. Because of our team’s breadth of talent, we have been able to improve AWS cloud infrastructure in high-performance machine learning with AWS Neuron, Inferentia and Trainium ML chips, in networking and security with products such as AWS Nitro, Enhanced Network Adapter (ENA), and Elastic Fabric Adapter (EFA), and in computing with AWS Graviton and F1 EC2 instances. About AWS Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses. About AWS Neuron: AWS Neuron is the software of Trainium and Inferentia, the AWS Machine Learning chips. Inferentia delivers best-in-class ML inference performance at the lowest cost in the cloud to our AWS customers. Trainium is designed to deliver the best-in-class ML training performance at the lowest training cost in the cloud, and it’s all being enabled by AWS Neuron. Neuron is a Software that include ML compiler and native integration into popular ML frameworks. Our products are being used at scale with external customers like Anthropic and Databricks as well as internal customers like Alexa, Amazon Bedrock, Amazon Robotics, Amazon Ads, Amazon Rekognition and many more. Job Summary You will join a dynamic team working at the cutting edge of the GenAI revolution by applying AI to AI. You will work on building agents, tools, and models to simplify and accelerate customer adoption of Neuron, the software stack supporting Amazon's Machine Learning silicon: Trainium. Partnering with external and internal customers, you will identify key obstacles and opportunities to accelerate their migration to AWS's ML silicon. You will be a key contributor driving impact by building AI agents and tools that simplify AWS Neuron adoption, which is critical to AWS's Generative AI business.

Requirements

  • 3+ years of non-internship professional software development experience
  • 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
  • Experience programming with at least one software programming language
  • Experience in written and verbal communication with the ability to present complex technical information in a clear and concise manner to executives and non-technical leaders
  • Knowledge of software engineering best practices across the development life cycle, including agile methodologies, coding standards, code reviews, source management, build processes, testing, and operations
  • Computer Science core: object-oriented design, data structures, and performance analysis with at least 2 programming languages.
  • Hands-on technical experience working in the Generative AI space
  • Experience in one or more of the following areas: ML compilers, production coding agents, GenAI model architecture, model training, neural network optimization, or alternatively applied math.
  • Passion for customer experience and usability, including successful delivery of customer self-service tools and automated management/optimization of services, and a strong services orientation
  • 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
  • Bachelor's degree or above in computer science or equivalent
  • Knowledge of AWS services
  • 2+ years in machine learning or other computational modeling environments with an emphasis on hosting, building or optimizing models for diverse hardware platforms
  • Proven track record in building AI agents that automate ML workload optimization, ML compiler tuning, distributed inference and training, or ML kernel authoring and optimization
  • Experience working with open-source software communities in the optimization space or related areas
  • Knowledge of the state-of-the-art technology used in the Machine Learning space and its mathematical underpinning

Responsibilities

  • Research implementations that deliver the best possible experiences for customers.
  • Deliver on goals to improve the time and effort it takes to port and optimize Machine Learning workloads on Neuron.
  • Solve challenging technical problems, often ones not solved before, at every layer of the stack
  • Design, implement, test, deploy and maintain innovative software solutions to transform service performance, durability, cost, and security.
  • Build high-quality, highly available, always-on products.
  • Potentially contribute intellectual property through patents
  • Build high-impact solutions to deliver to our large customer base.
  • Participate in design discussions, code review, and communicate with internal and external stakeholders.
  • Work cross-functionally to help drive business decisions with your technical input.
  • Work in a startup-like development environment, where you’re always working on the most important stuff.

Benefits

  • health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • paid time off
  • parental leave

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Entry Level

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service