Microsoft-posted 3 days ago
Full-time • Mid Level
Redmond, WA
5,001-10,000 employees

Own end-to-end delivery: Lead the full modeling lifecycle for security scenarios, from data ingestion and curation to training, evaluation, deployment, and monitoring. Problem framing, literature review, model design, offline evaluation, online experimentation, and production deployment. Implement and optimize models: Design and implement privacy-preserving data workflows, including anonymization, templating, synthetic augmentation, and quantitative utility measurement. Develop and maintain fine-tuning and adaptation recipes for transformer models, including parameter-efficient methods and reinforcement learning from human or synthetic feedback. Establish objective benchmarks, metrics, and automated gates for accuracy, robustness, safety, and performance, enabling repeatable model shipping. Productionize AI & ML Collaborate with engineering and product teams to productionize models, harden pipelines, and meet service-level objectives for latency, throughput, and availability. Develop fine-tuning techniques for transformer models and establish benchmarks for accuracy, robustness, and performance to ensure reliable model delivery. Drive MLOps best practices: CI/CD, model registry, feature store, model serving, monitoring/drift. Champion Responsible AI: fairness, explainability, privacy (GDPR/CCPA) and security considerations in model design and deployment. Operational excellence: code quality, tests, observability (logs/metrics/traces), on-call ownership for ML services, and SLA adherence. Collaborate cross‑functionally: write design docs/RFCs, partner with PMs and engineers, and drive execution towards predictable outcomes and timelines.

  • Lead the full modeling lifecycle for security scenarios, from data ingestion and curation to training, evaluation, deployment, and monitoring.
  • Design and implement privacy-preserving data workflows, including anonymization, templating, synthetic augmentation, and quantitative utility measurement.
  • Develop and maintain fine-tuning and adaptation recipes for transformer models, including parameter-efficient methods and reinforcement learning from human or synthetic feedback.
  • Establish objective benchmarks, metrics, and automated gates for accuracy, robustness, safety, and performance, enabling repeatable model shipping.
  • Collaborate with engineering and product teams to productionize models, harden pipelines, and meet service-level objectives for latency, throughput, and availability.
  • Develop fine-tuning techniques for transformer models and establish benchmarks for accuracy, robustness, and performance to ensure reliable model delivery.
  • Drive MLOps best practices: CI/CD, model registry, feature store, model serving, monitoring/drift.
  • Champion Responsible AI: fairness, explainability, privacy (GDPR/CCPA) and security considerations in model design and deployment.
  • Operational excellence: code quality, tests, observability (logs/metrics/traces), on-call ownership for ML services, and SLA adherence.
  • Collaborate cross‑functionally: write design docs/RFCs, partner with PMs and engineers, and drive execution towards predictable outcomes and timelines.
  • Doctorate in Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science, or related field AND 5+ years data-science experience (e.g., managing structured and unstructured data, applying statistical techniques and reporting results) OR Master's Degree in Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science, or related field AND 7+ years data-science experience (e.g., managing structured and unstructured data, applying statistical techniques and reporting results) OR Bachelor's Degree in Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science, or related field AND 10+ years data science experience (e.g., managing structured and unstructured data, applying statistical techniques and reporting results) OR equivalent experience.
  • GenAI experience: fine‑tuning, instruction tuning, RAG pipelines, evaluation harnesses (e.g., task-specific metrics, human-in-the-loop).
  • Strong software engineering: Python + one systems language (C++/Java/Go/Rust), data structures/algorithms, code reviews, testing.
  • MLOps expertise: CI/CD (GitHub Actions/Azure DevOps), containers (Docker), orchestration (Kubernetes), model registry/feature store, monitoring & drift detection.
  • Experimentation: A/B testing design, statistical rigor; metrics for model quality and business impact (precision/recall, ROC/AUC, NDCG/MAP, uplift).
  • Data engineering: SQL, Spark/Databricks, data modeling, data quality and reproducibility.
  • Communication & execution: clear writing, design docs, stakeholder alignment, and consistent delivery to milestones.
  • Proven track record of shipping ML systems to production at scale (not just prototypes); portfolio or references welcome.
  • Applied research skills: reading SOTA literature, rapid replication, hypothesis-driven iteration, and practical adaptation to product constraints.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service