DigitalOcean-posted 26 days ago
Full-time • Manager
Remote • Boston, MA
1,001-5,000 employees
Publishing Industries

Dive in and do the best work of your career at DigitalOcean. Journey alongside a strong community of top talent who are relentless in their drive to build the simplest scalable cloud. If you have a growth mindset, naturally like to think big and bold, and are energized by the fast-paced environment of a true industry disruptor, you'll find your place here. We value winning together-while learning, having fun, and making a profound difference for the dreamers and builders in the world. We want people who are passionate about building features that you and your peers will love. DigitalOcean's GradientAI Infrastructure Team is welcoming a new technical engineering manager to support our engineers, grow our culture, and lead a team developing our AI/ML infrastructure products. Upon selection, you will be responsible for guiding the development of a 6-8 person engineering team, facilitating communications, providing clarity of vision and priority, and empowering the team to create innovative solutions for our partners and customers. This team will be building a new product that will bring our famed DigitalOcean Simplicity to the world of Large Language Model (LLM) hosting, serving, and optimization. If you are someone who shares our passions for technology solutions, healthy services, and being loving service providers, team members, and leaders, we want to meet you!

  • Growing and leading a highly-collaborative engineering team
  • Developing and shepherding complex AI and cloud engineering projects through the entire product development lifecycle (PDLC) - ideation, product definition, experimentation, prototyping, development, testing, release, and operations
  • Helping the team achieve higher standards of performance and product quality
  • Introducing and improving processes for team performance and quality-of-life
  • Collaborating with product owners and cross-functional teams to design idiomatic, feature-rich, and operationally sustainable software solutions
  • Oversee the design and implementation of scalable, automated systems for DNS provisioning, monitoring, and failover.
  • Facilitating transparent, constructive communication and a fair, but growth-oriented, distribution of responsibilities between team members
  • Providing coaching and counseling via mentoring, one-on-one meetings, etc
  • 7+ years of experience in software engineering, which should include 4+ years of distributed systems development, 2+ years building AI/ML technologies (ideally related to LLM hosting and inference), and 2+ years in a people management or team lead role.
  • A passion for leading, coaching, and mentoring software engineers
  • Enduring interest in distributed systems design, AI/ML, and implementation at scale in the cloud.
  • Deep expertise in cloud computing platforms and modern AI/ML technologies
  • Experience with modern LLMs, ideally related to hosting, serving, and optimizing such models
  • Experience researching, evaluating, and building with open source technologies
  • Proficiency in programming languages commonly used in cloud development, such as Python and Go
  • Experience with infrastructure as code (IaC) tools like Terraform or Ansible
  • Knowledge of networking concepts (e.g., TCP/IP, VPCs, subnets, routing) and storage systems
  • A strong sense of ownership and a drive to figure out and resolve any issues preventing you and your team from delivering value to your customers
  • An appreciation for process and developing cross-disciplinary collaboration between engineering, operations, support, and product groups
  • Strong project management skills
  • Familiarity with end-to-end quality best practices and their implementation
  • Enthusiasm for staffing, interviewing, growing, and retaining teams
  • Experience coordinating with partner teams across time zones and geographies
  • Experience with various GPU platforms from AMD and NVIDIA and associated toolsets for tuning, configuring, and accelerating workloads on them would be ideal, but not required
  • We innovate with purpose.
  • We prioritize career development.
  • We care about your well-being.
  • We reward our employees.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service