Senior / Principal C++ Software Engineer, Data Engines
Voltron Data
·
Posted:
July 31, 2023
·
Remote
About the position
We are seeking a highly motivated Senior or Principal C++ Software Engineer for Data Engines to work on data execution engines within the Apache Arrow project. The role involves collaborating with Voltron Data development teams to enhance modularity and composability of data analytics tools. The focus will be on maintaining, optimizing, and implementing new features to improve the efficiency, performance, scalability, and stability of data engines. This position offers the opportunity to contribute to the development of a new foundation for data analytics and work within a diverse and inclusive company committed to employee growth.
Responsibilities
- Work directly on data execution engines for the Apache Arrow project
- Collaborate with Voltron Data development teams to develop data analytics tools
- Maintain, optimize, and implement new features with a focus on performance, scalability, and stability
- Learn and understand the Apache Arrow compute primitives, compute intermediate representation, query parser and optimizer, and other foundational components
- Understand the architecture of data processing engines and how different components interact with each other
- Profile query runs for data engines and analyze execution telemetry
- Dive deep into relational algebra algorithms and find ways to improve them
- Fix bugs, implement simple optimizations, and refactor code
- Make improvements to engine components such as resource management, task scheduling, and fault tolerance
- Build new relational algebra components to expand SQL coverage or DataFrame functionality coverage
- Help systems improve by using more modular and composable components
- Propose and implement core architecture improvements to engine components
- Work on challenging tasks such as language agnostic user-defined functions, multi-query concurrency, and multi-tenancy
- Integrate engines with other components and features developed by other teams in the company
Requirements
- Highly motivated Senior or Principal C++ Software Engineer
- Experience working on data execution engines
- Familiarity with the Apache Arrow project
- Ability to develop data analytics tools to improve modularity and composability
- Proficiency in maintaining, optimizing, and implementing new features
- Strong focus on performance, scalability, and stability
- Knowledge of data engines' efficiency at all scales
- Series A, venture-backed startup experience
- Experience in driving open source ecosystems, particularly in C++, Python, and R programming
- Commitment to building a diverse and inclusive workplace
- Willingness to learn and embrace the Apache development process
- Profiling query runs and analyzing execution telemetry
- Understanding of relational algebra algorithms and ability to improve them
- Bug fixing, optimization, and code refactoring skills
- Familiarity with resource management, task scheduling, and fault tolerance
- Building new relational algebra components to expand SQL coverage or DataFrame functionality
- Proposing and implementing core architecture improvements to engine components
- Working on challenging tasks such as language agnostic user defined functions, multi-query concurrency, and multi-tenancy
- Integration of engines with other components and features developed by other teams
Benefits
- Work from Anywhere - Payroll and Benefits in 150+ Countries
- Unlimited PTO
- Medical, Dental, and Vision
- Retirement [USA Only]
- Home Office Budget
- Continuing Education Budget