Senior Data Engineer

VIA Science•Somerville, MA

10d•$120,000 - $160,000•Hybrid

About The Position

VIA is making an impact, and so can you. At VIA, our mission is to make communities cleaner, safer, and more equitable. We believe that by working across organizational boundaries, we can achieve greater collective good than we can individually. VIA overcomes digital barriers to collective action by providing the world’s most secure and simple data and identity protection solutions. VIA is trusted by the U.S. Department of Defense and Fortune 100 companies around the globe to solve their toughest data and identity protection challenges. Using our Web3, quantum-resistant, passwordless technologies (19 issued patents), VIA protects data against theft, manipulation, and misuse. An impressive mission requires an equally impressive Senior Data Engineer. As a Senior Data Engineer at VIA, you will play a pivotal role in the growth of our solutions. You will build the foundation that empowers our customers to harness AI for human-centric, data-driven decision-making. You will work cross-functionally with a high-performing team of data professionals, developers, DevOps, and Client Delivery specialists who are already pushing the boundaries of what’s possible with AI. Individuals who excel in this role are motivated by solving complex data accessibility challenges, holding a high bar for data quality and availability, and improving performance. Are you ready to join us?

Requirements

Education: Bachelor’s degree or higher in Computer Science, Engineering, or Data Science
Experience: 5+ years of professional experience in data engineering or a related role
Core engineering: A strong foundation in Python (or equivalent), including testing frameworks (e.g., pytest) and ORMs (e.g., SQLAlchemy) You understand modularity and how to define clear scopes and responsibilities within a large codebase
Data architecture: Proven experience architecting scalable relational and non-relational (SQL/noSQL) schemas You manage the end-to-end database lifecycle, from initial design to production maintenance
Performance engineering: Expertise in maximizing system performance through advanced query tuning, strategic indexing, and execution plan analysis to eliminate technical bottlenecks
Cloud infrastructure: Experience with one or more cloud-based databases (e.g., AWS RDS, Azure Database) You are comfortable configuring compute resources, backups, and geolocation requirements
Data orchestration: Experience building resilient pipelines using frameworks such as Dagster or Apache Airflow You have a track record of maintaining data health for both real-time streaming and batch processing
Systems thinking: A strong understanding of how data infrastructure integrates into the broader application architecture
Professional standards: Experience with modern software development practices, including version control (Git), CI/CD pipelines, and a commitment to high-quality, maintainable code

Nice To Haves

Streaming and edge tech: Experience working with streaming data (e.g., Kafka) or running data models on the edge (e.g., Raspberry Pi, IoT devices)
DevOps tools: Familiarity with containerization and orchestration tools such as Docker and Kubernetes
API design: Experience architecting and consuming scalable RESTful APIs using standardized design principles and robust authentication protocols
Web3 and privacy: Familiarity with blockchain data indexing or privacy-preserving data processing techniques
Leadership: Experience mentoring junior engineers or leading technical projects within a high-performing team

Responsibilities

Architect secure solutions: Design and implement robust, cloud-based data storage solutions, optimizing schemas for multi-tenant environments while ensuring data accessibility and security and a high standard of trust and transparency
Engineer data pipelines: Develop, deploy, and maintain resilient ETL/ELT pipelines for both real-time streaming and batch processing, ensuring seamless data flow from raw ingestion to production-ready applications
Facilitate data accessibility: Build and manage data access layers, including REST APIs and streaming services, to empower downstream users
Drive data governance and best practices: Contribute across teams to recommend tools, processes, and best practices for maintaining data health, integrity, and security
Operationalize AI models: Support AI operations (MLOps) by managing versioning, containerization, and deployment of AI models
Monitor and optimize infrastructure: Build monitoring and alerting systems to track data health and system performance, proactively identifying and remediating bottlenecks

Benefits

A salary range of $120,000 - $160,000
A fully funded, top-tier health benefits plan, fully covered from day one, including vision and dental coverage for your whole family
Flexible Vacation Policy with no set annual limit or accrual period, Summer Fridays, and an extended holiday period in December
401(k) plan with up to 5% employer contribution
Paid parental leave, supporting new parents and families
A dedicated wellness advisor to help you navigate the programs and opportunities available at VIA
Ability to enjoy the best of both worlds with flexibility to work from home as needed, as well as access to four well-located offices, designed for collaboration and stocked with everything you could need
Opportunities to work remotely from eligible locations for up to 2 months per year
Individualized growth opportunities, including internal and external mentorship panels, custom goals and feedback sessions, and/or access to learning and development programs, including VIA’s unrivaled leadership program
Transit benefits to support commuting costs
In-person events to foster team bonding and collaboration across different teams