Data Scientist III - Lead Data Architect

AstreyaOakland, CA
Hybrid

About The Position

We are seeking a Data Modeling Expert with strong AI/Analytics focus to enable next-generation data platforms supporting predictive analytics, machine learning, and intelligent automation. This role will design and optimize data models that power use cases such as grid reliability, predictive maintenance, wildfire risk modeling, customer analytics, and AI-driven operations.

Requirements

  • 8+ years in data modeling, data architecture, or analytics engineering
  • 3+ years of Utility/energy domain experience (smart grid, AMI, SCADA systems) supporting electric, gas, and/or water utilities.
  • Strong expertise in: Dimensional modeling for analytics (Star/Snowflake schemas)
  • Data modeling for machine learning pipelines
  • SQL and data transformation frameworks (dbt preferred)
  • Experience designing data models for: Data lakes / lakehouse architectures (Delta Lake, Iceberg, etc.)
  • Structured + semi-structured data (JSON, Parquet)
  • Proven experience supporting AI/ML workloads in production environments

Nice To Haves

  • Experience with cloud AI ecosystems : AWS (SageMaker, Redshift) Azure (Synapse, Azure ML) GCP (BigQuery, Vertex AI)
  • Familiarity with time-series and streaming platforms (Kafka, Spark Streaming)
  • Knowledge of feature stores (Feast, Tecton)
  • Experience with MLOps tools (MLflow, Kubeflow)
  • Understanding of LLM data preparation , vector databases, and embeddings

Responsibilities

  • Design AI-ready data models to support machine learning, advanced analytics, and real-time decisioning
  • Build and maintain feature-ready datasets for data science teams (feature engineering support)
  • Develop semantic and analytical data layers for BI, AI, and self-service analytics
  • Collaborate with data scientists to translate ML use cases into scalable data structures
  • Model and integrate high-volume time-series and IoT data (e.g., smart meters, sensors, grid telemetry)
  • Enable real-time / near-real-time data pipelines for AI-driven insights
  • Ensure data models support MLOps frameworks (model training, validation, deployment pipelines)
  • Implement data lineage, observability, and quality frameworks to support trusted AI outcomes
  • Optimize data structures for lakehouse architectures and distributed compute environments
  • Align with data governance, privacy, and regulatory compliance requirements

Benefits

  • Medical provided through UHC (PPO, HSA, Surest options) / Medical provided through Kaiser (HMO option only) for California employees only
  • Dental provided through UHC Nationwide
  • Vision provided by UHC
  • Flexible Spending Account for Health & Dependent Care
  • Pre-Tax Account for Commuter Benefit/Parking & Transit (location-specific)
  • Continuing Education and Professional Development via various integrated platforms, e.g. Udemy and Coursera
  • Corporate Wellness Program provided by Goomi Group
  • Employee Assistance Program
  • Wellness Days
  • 401k Plan
  • Basic and Supplemental Life Insurance
  • Short Term & Long Term Disability
  • Critical Illness, Critical Hospital, and Voluntary Accident Insurance
  • Tuition Reimbursement (available 6 months after start date, capped)
  • Paid Time Off (accrued and prorated, maximum of 120 hours annually)
  • Paid Holidays
  • Any other statutory leaves, paid time, or other ancillary benefits required under state and federal law
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service