Principal Engineer - Databases, Data Platform, MLOps

Abbott Laboratories•Alameda, CA

20d•$128,000 - $256,000

About The Position

At Lingo, we’re building a groundbreaking health platform that combines continuous biosensor data, real-time analytics, and personalized insights to help people live fuller, longer, and healthier lives. Our systems ingest millions of sensor readings daily, powering experiences for consumers and partners worldwide, with the reliability and scalability of cloud-native, enterprise-grade platforms. THE OPPORTUNITY We’re looking for a Principal Engineer to lead the technical vision for our database infrastructure and enterprise data platform. You will own architecture decisions that enable large-scale, reliable, and secure data systems spanning batch, streaming, and real-time workloads. While AI and MLOps are emerging areas for us, you’ll play a key role in laying a foundation for future machine learning capabilities. This role combines hands-on engineering with tech strategic leadership across databases and data pipelines. This is an opportunity to shape the foundation of a platform that will support millions of users globally.

Requirements

10+ years of software engineering experience with 7+ years focused on databases, data engineering, or ML infrastructure
Deep knowledge of relational and NoSQL databases, query optimization, replication, and sharding.
Experience building and operating large-scale data pipelines
Strong SQL and data modeling skills (dimensional, 3NF, Data Vault).
Familiarity with cloud-native data services (AWS, Azure, GCP).
Proven ability to lead cross-functional technical initiatives.

Nice To Haves

Exposure to vector databases (pgvector, Pinecone, Milvus) and generative AI retrieval patterns.
Knowledge of regulated industry requirements (HIPAA, SOC2, GDPR).
Experience with Kubernetes and containerized data workloads.
Understanding of FinOps and data infrastructure cost optimization.

Responsibilities

Design and maintain database systems across relational, NoSQL, and time-series technologies
Implement performance tuning, indexing, sharding, replication, and failover strategies
Develop standards for schema design, migrations, and capacity planning
Architect and optimize data pipelines using Spark, Flink, Kafka, Airflow, or equivalent
Evolve our data platform to support batch, streaming, and real-time needs as well as define backups, restore testing, patching, and security controls to ensure compliance
Drive data quality, observability, lineage tracking, and efficient resource usage
Coordinate with stakeholders to define and enforce data modeling practices
Design our MLOps foundations and practices such as data preparation, reproducibility, and monitoring
Drive integration points for future ML model deployment and vector database use cases.
Serve as a technical anchor across data and ML domains, providing architectural guidance and code reviews for complex systems
Author technical RFCs, architecture decision records, and engineering standards adopted across teams
Mentor engineers, raise the bar on engineering excellence and best practices
Collaborate with Product, Security, Compliance, and Infrastructure teams to align data strategy with business objectives
Represent the organization in vendor evaluations, open-source community engagement, and industry forums