Senior Software Engineer, Core Data Resilience

BoxRedwood City, CA
4h$198,000 - $248,000Hybrid

About The Position

Box (NYSE:BOX) is the leader in Intelligent Content Management. Our platform enables organizations to fuel collaboration, manage the entire content lifecycle, secure critical content, and transform business workflows with enterprise AI. We help companies thrive in the new AI-first era of business. Founded in 2005, Box simplifies work for leading global organizations, including JLL, Morgan Stanley, and Nationwide. Box is headquartered in Redwood City, CA, with offices across the United States, Europe, and Asia. By joining Box, you will have the unique opportunity to continue driving our platform forward. Content powers how we work. It’s the billions of files and information flowing across teams, departments, and key business processes every single day: contracts, invoices, employee records, financials, product specs, marketing assets, and more. Our mission is to bring intelligence to the world of content management and empower our customers to completely transform workflows across their organizations. With the combination of AI and enterprise content, the opportunity has never been greater to transform how the world works together and at Box you will be on the front lines of this massive shift. WHY BOX NEEDS YOU Core Data at Box powers the most fundamental layer of Box’s content platform, enabling millions of queries per second across a highly available and consistent relational data tier. Within Core Data, the Resilience team ensures that the services stay healthy, performant, and fault-tolerant - especially under load. We're looking for a Senior Software Engineer to join this Resilience team and help drive system improvements that protect Core Data’s MySQL, Memcached, and Redis-based data tier and evolve how services like Credence (Core Data’s interface layer) scale and self-heal. Collaborating with teams across Core Data and across the organization, you will help build resilient features, automate failure handling, and define best practices around some of our hardest load and resilience-related challenges. You’ll help shape how Box engineers interact with our relational data layer, while supporting our broader technical vision of transforming our relational data tier into a truly self-contained platform. Below are some of the articles describing some facets of Box Core Data that you might find interesting: How We Learned to Stop Worrying and Read from Replicas [medium.com] Strategies Used at Box to Protect MySQL at Scale [medium.com] Cache is the Root of All Evil [medium.com]

Requirements

  • Bachelor's degree in Computer Science, Mathematics, or a related field
  • 4+ years of professional software development experience
  • Proficient in common algorithms, data structures, and code design principles
  • Experience developing high-scale distributed systems
  • We are an AI-first company. This means you approach your work with a growth mindset and find ways to leverage AI to help make faster, smarter decisions that will 10X your impact at Box.

Nice To Haves

  • Familiarity with MySQL internals a huge plus.
  • Experience working with JVM-based services like Scala
  • Passionate about solving scale and performance challenges
  • Strong sense of ownership, persistence, and drive
  • Excellent communication skills

Responsibilities

  • Build and own scalable infrastructure to help database service and its interface layer remain performant and available under heavy traffic and edge conditions
  • Contribute to the design and implementation of new infrastructure components, and help uplift existing systems to meet evolving business needs.
  • Improve system resilience by identifying bottlenecks, designing fallback strategies, and evolving load-routing logic
  • Provide input into the broader Core Data’s technical direction through collaboration and execution.
  • Work in Scala and Python, with a strong focus on services that interact with our MySQL tier
  • Participate in the on-call rotation and take ownership during incidents related to database load, unavailability, or general SLO breaches.
  • Participate in our on-call rotation, available at all times while on-call to help respond to and triage any issues that arise.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service