About the position
Arena is seeking a Senior Data Engineer to design, develop, and maintain data management systems for their conversation AI platform. The role involves building robust data pipelines, ensuring data governance and security, and collaborating with cross-functional teams. The ideal candidate should have proven experience as a Data Engineer, strong programming skills, and familiarity with cloud-based data platforms. They should also possess knowledge of data modeling, data governance, and security best practices. This is an opportunity to work in a fast-paced environment and contribute to the success of a tech company revolutionizing the conversation AI industry.
Responsibilities
- Design, develop, and maintain data management systems that power the platform
- Build robust, reliable, and scalable data pipelines from scratch
- Identify and integrate relevant data sources
- Design data extraction, transformation, and loading processes
- Implement data quality and validation checks
- Collaborate with cross-functional teams to meet the data infrastructure needs
- Establish data governance policies and procedures
- Ensure compliance with data privacy, security, and integrity regulations and best practices
- Implement mechanisms for data lineage, metadata management, and access controls
- Ensure system security, performance, and up-to-date status
- Take accountability for the quality of work
- Write clean, efficient, and maintainable code
- Participate in code reviews, architecture discussions, and engineering activities
- Collaborate with other teams to understand their data needs and provide guidance
- Share expertise, promote data engineering best practices, and mentor team members
- Have a Bachelor's or Master's degree in Computer Science, Engineering, or a related field
- Have proven experience as a Data Engineer, ideally in a startup or fast-paced environment
- Possess strong programming skills in Python, R, Java, or Scala
- Proficiency in SQL is essential
- Experience in building data pipelines, data warehouses, and ETL processes from scratch
- Familiarity with cloud-based data platforms and services (e.g., AWS, GCP) and their associated data tools
- Knowledge of data modeling, data governance, and security best practices
- Understanding of big data technologies and distributed computing frameworks is a plus
- Be self-driven and resourceful with the ability to work independently and take ownership of projects
- Have strong problem-solving skills and the ability to adapt to changing priorities in a dynamic startup environment
- Possess excellent communication and collaboration skills
Requirements
- Bachelor's or Master's degree in Computer Science, Engineering, or a related field
- Proven experience as a Data Engineer, ideally in a startup or fast-paced environment
- Strong programming skills in languages such as Python, R, Java, or Scala. Proficiency in SQL is essential
- Experience building data pipelines, data warehouses, and ETL processes from scratch
- Familiarity with cloud-based data platforms and services (e.g., AWS, GCP) and their associated data tools
- Knowledge of data modeling, data governance, and security best practices
- Understanding of big data technologies and distributed computing frameworks is a plus
- Self-driven and resourceful with the ability to work independently and take ownership of projects
- Strong problem-solving skills and the ability to adapt to changing priorities in a dynamic startup environment
- Excellent communication and collaboration skills
Benefits
- Work remotely with opportunities to visit the HQ in San Francisco, California
- Competitive salary package and benefits
- Company equity
- 4 weeks of paid time-off
- Generous Learning and Development budget
- Key moment to join Arena in terms of growth and opportunities
- Ability to put your stamp on an innovative product
- Fast-learning environment, entrepreneurial and strong team spirit
- Multiple nationalities: cosmopolite and multi-cultural mindset
- Work-life balance is important at Arena