Speechify is seeking a skilled Software Engineer to join the Data side of their AI team. This role is responsible for all aspects of data collection to support model training operations. The team is capable of building high-quality datasets at petabyte-scale and low cost through a tight integration of infrastructure, engineering, and research work. The engineer will help find new sources of audio data, bring it into the ingestion pipeline, and operate and extend the cloud infrastructure for this pipeline, which currently runs on GCP and is managed with Terraform. Collaboration with Scientists and AI Team leadership is key to defining the dataset roadmap and improving the cost/throughput/quality frontier for next-generation models and products.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior