Senior Data Engineer

VIA ScienceSomerville, MA
10d$120,000 - $160,000Hybrid

About The Position

VIA is making an impact, and so can you. At VIA, our mission is to make communities cleaner, safer, and more equitable. We believe that by working across organizational boundaries, we can achieve greater collective good than we can individually. VIA overcomes digital barriers to collective action by providing the world’s most secure and simple data and identity protection solutions. VIA is trusted by the U.S. Department of Defense and Fortune 100 companies around the globe to solve their toughest data and identity protection challenges. Using our Web3, quantum-resistant, passwordless technologies (19 issued patents), VIA protects data against theft, manipulation, and misuse. An impressive mission requires an equally impressive Senior Data Engineer. As a Senior Data Engineer at VIA, you will play a pivotal role in the growth of our solutions. You will build the foundation that empowers our customers to harness AI for human-centric, data-driven decision-making. You will work cross-functionally with a high-performing team of data professionals, developers, DevOps, and Client Delivery specialists who are already pushing the boundaries of what’s possible with AI. Individuals who excel in this role are motivated by solving complex data accessibility challenges, holding a high bar for data quality and availability, and improving performance. Are you ready to join us?

Requirements

  • Education: Bachelor’s degree or higher in Computer Science, Engineering, or Data Science
  • Experience: 5+ years of professional experience in data engineering or a related role
  • Core engineering: A strong foundation in Python (or equivalent), including testing frameworks (e.g., pytest) and ORMs (e.g., SQLAlchemy) You understand modularity and how to define clear scopes and responsibilities within a large codebase
  • Data architecture: Proven experience architecting scalable relational and non-relational (SQL/noSQL) schemas You manage the end-to-end database lifecycle, from initial design to production maintenance
  • Performance engineering: Expertise in maximizing system performance through advanced query tuning, strategic indexing, and execution plan analysis to eliminate technical bottlenecks
  • Cloud infrastructure: Experience with one or more cloud-based databases (e.g., AWS RDS, Azure Database) You are comfortable configuring compute resources, backups, and geolocation requirements
  • Data orchestration: Experience building resilient pipelines using frameworks such as Dagster or Apache Airflow You have a track record of maintaining data health for both real-time streaming and batch processing
  • Systems thinking: A strong understanding of how data infrastructure integrates into the broader application architecture
  • Professional standards: Experience with modern software development practices, including version control (Git), CI/CD pipelines, and a commitment to high-quality, maintainable code

Nice To Haves

  • Streaming and edge tech: Experience working with streaming data (e.g., Kafka) or running data models on the edge (e.g., Raspberry Pi, IoT devices)
  • DevOps tools: Familiarity with containerization and orchestration tools such as Docker and Kubernetes
  • API design: Experience architecting and consuming scalable RESTful APIs using standardized design principles and robust authentication protocols
  • Web3 and privacy: Familiarity with blockchain data indexing or privacy-preserving data processing techniques
  • Leadership: Experience mentoring junior engineers or leading technical projects within a high-performing team

Responsibilities

  • Architect secure solutions: Design and implement robust, cloud-based data storage solutions, optimizing schemas for multi-tenant environments while ensuring data accessibility and security and a high standard of trust and transparency
  • Engineer data pipelines: Develop, deploy, and maintain resilient ETL/ELT pipelines for both real-time streaming and batch processing, ensuring seamless data flow from raw ingestion to production-ready applications
  • Facilitate data accessibility: Build and manage data access layers, including REST APIs and streaming services, to empower downstream users
  • Drive data governance and best practices: Contribute across teams to recommend tools, processes, and best practices for maintaining data health, integrity, and security
  • Operationalize AI models: Support AI operations (MLOps) by managing versioning, containerization, and deployment of AI models
  • Monitor and optimize infrastructure: Build monitoring and alerting systems to track data health and system performance, proactively identifying and remediating bottlenecks

Benefits

  • A salary range of $120,000 - $160,000
  • A fully funded, top-tier health benefits plan, fully covered from day one, including vision and dental coverage for your whole family
  • Flexible Vacation Policy with no set annual limit or accrual period, Summer Fridays, and an extended holiday period in December
  • 401(k) plan with up to 5% employer contribution
  • Paid parental leave, supporting new parents and families
  • A dedicated wellness advisor to help you navigate the programs and opportunities available at VIA
  • Ability to enjoy the best of both worlds with flexibility to work from home as needed, as well as access to four well-located offices, designed for collaboration and stocked with everything you could need
  • Opportunities to work remotely from eligible locations for up to 2 months per year
  • Individualized growth opportunities, including internal and external mentorship panels, custom goals and feedback sessions, and/or access to learning and development programs, including VIA’s unrivaled leadership program
  • Transit benefits to support commuting costs
  • In-person events to foster team bonding and collaboration across different teams
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service