Software Engineer
Full Job Description
Speechify seeks a Software Engineer for its Data Infrastructure & Acquisition team in Ahmedabad, India, to support AI model training operations. This role is crucial for building high-quality, petabyte-scale datasets cost-effectively through integrated infrastructure, engineering, and research. The engineer will be responsible for all aspects of data collection.
What You'll Do:
- Identify and integrate new audio data sources into the ingestion pipeline.
- Maintain and enhance the cloud infrastructure for the ingestion pipeline, utilizing GCP and Terraform.
- Collaborate with AI Scientists to optimize the cost, throughput, and quality of data for next-generation models.
- Contribute to defining the AI team's dataset roadmap for Speechify's consumer and enterprise products.
Ideal Candidate Profile:
- Bachelor's/Master's/PhD in Computer Science or a related field.
- Minimum 5 years of professional software development experience.
- Proficiency in bash/Python scripting within Linux environments.
- Expertise in Docker and Infrastructure-as-Code, with practical experience in a major cloud provider (GCP preferred).
- Experience with web crawlers and large-scale data processing workflows is advantageous.
- Proven ability to manage multiple tasks and adapt to evolving priorities.
- Excellent written and verbal communication skills.
Speechify offers a dynamic, fast-growing environment with a focus on empowering employees and fostering an entrepreneurial spirit. Benefits include competitive salaries, a supportive culture, and the chance to work on a transformative product impacting millions, particularly individuals with learning differences.
Company
Speechify
Speechify is a globally distributed company dedicated to removing reading as a barrier to learning. With over 50 million users, Speechify's text-to-speech products transform various reading materials ...