Speechify is seeking a talented Software Engineer specializing in Data Infrastructure & Acquisition to join our AI team in Mumbai, India. In this pivotal role, you will be instrumental in building and optimizing the data collection infrastructure essential for our AI model training operations. Our mission is to construct high-quality, petabyte-scale datasets cost-effectively through seamless integration of infrastructure, engineering, and research.
Responsibilities:
- Proactively identify and integrate new sources of audio data into our ingestion pipeline.
- Manage and enhance our cloud-based ingestion infrastructure, currently running on GCP and managed with Terraform.
- Collaborate closely with AI Scientists to improve data throughput, quality, and cost-efficiency, enabling the development of next-generation models.
- Partner with the AI Team and Speechify Leadership to define the dataset roadmap, supporting the evolution of our consumer and enterprise products.
Qualifications:
- BS/MS/PhD in Computer Science or a related discipline.
- Minimum of 5 years of professional software development experience.
- Proficiency in bash/Python scripting within Linux environments.
- Strong understanding of Docker and Infrastructure-as-Code principles, with practical experience using a major cloud provider (GCP preferred).
- Familiarity with web crawlers and large-scale data processing workflows is advantageous.
- Demonstrated ability to manage multiple priorities in a dynamic environment.
- Excellent written and verbal communication skills.
Why Join Speechify?
- Contribute to a rapidly growing company and product.
- Be part of an entrepreneurial team that encourages innovation and initiative.
- Benefit from a supportive, hands-off management style.
- Make a significant impact in a transformative industry.
- Receive competitive compensation, a positive work environment, and a commitment to asynchronous work culture.
- Work on a product that profoundly impacts users' lives, especially those with learning differences such as dyslexia, ADD, low vision, and autism.
- Operate at the cutting edge of AI and audio technology.
We encourage you to share your portfolio and LinkedIn profile, along with your motivation for applying.