About The Position

Speechify is seeking a skilled Software Engineer to join their Data team within the AI department. This role is crucial for all aspects of data collection, supporting model training operations. The team excels at building high-quality, petabyte-scale datasets at low cost through a synergistic integration of infrastructure, engineering, and research. The successful candidate will contribute to this effort by enhancing data collection processes and infrastructure.

Requirements

  • BS/MS/PhD in Computer Science or a related field.
  • 5+ years of industry experience in software development.
  • Proficiency with bash/Python scripting in Linux environments.
  • Proficiency in Docker and Infrastructure-as-Code concepts.
  • Professional experience with at least one major Cloud Provider (GCP preferred).
  • Ability to handle multiple tasks and adapt to changing priorities.
  • Strong communication skills, both written and verbal.

Nice To Haves

  • Experience with web crawlers.
  • Experience with large-scale data processing workflows.

Responsibilities

  • Find new sources of audio data and integrate them into the ingestion pipeline.
  • Operate and extend the cloud infrastructure for the ingestion pipeline, currently on GCP and managed with Terraform.
  • Collaborate with Scientists to improve the cost/throughput/quality of data delivery for next-generation models.
  • Collaborate with the AI Team and Speechify Leadership to define the AI Team’s dataset roadmap for future consumer and enterprise products.

Benefits

  • Competitive salaries
  • Bonus
  • Equity
  • Friendly and laid-back atmosphere
  • Commitment to building a great asynchronous culture
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service