Aqua Voice is building the voice input layer for the age of AI. We train our own models and build deep OS integrations because doing voice well requires controlling the entire stack. The way people work is changing. IC work is over; you manage AI agents now. This type of work is wonderfully suited for voice. We aren't building conversational agents. We believe the most natural way to interact with your computer is voice in text out (VITO). We believe that voice belongs at a level above the application, and that a small company can win by pushing the envelope. We are applying relentless energy to this opportunity and the results so far have been good. We hope you will join us. The Role We're a small team and growing. You'll touch every part of the system. Recent projects include: - Real-time transcription server handling thousands of concurrent audio streams - Custom speech recognition model training and deployment - Native macOS and Windows integrations using deep system APIs Stack - Frontend : TypeScript, React, Next.js, Electron - Backend : Python, real-time server (Bun/Node.js), WebSockets - Native : Swift (macOS), C# (Windows) - ML : Custom speech recognition models, inference pipelines - Infra : Terraform, Stripe
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed