Open language resources and strategies for building speech systems in under-resourced languages
By Shekhar Nayak
October 18, 2024
Summary
Open Language Resources: Utilize open sources like Common Voice project, Hugging Face's pre-trained models, and Zero Resource Speed Challenge for multilingual speech recognition and synthesis data.
Data Augmentation Strategies: Techniques like time stretching and pitch shifting can generate more training data, making it a valuable approach for building voice AI systems in under-resourced languages.
Building Voice AI Systems: Dr. Shekhar Nayak shares insights on open language resources and strategies for building speech systems in under-resourced languages, including data augmentation and multilingual acoustic modeling techniques.
Data Scarcity Challenges: Addressing data scarcity is crucial, as it remains a significant challenge in building voice AI systems, particularly in under-resourced languages.
Generated using GPT-4o-mini.
Share
More Videos of our talks
Practical Testing Strategies for Databricks: A Software Engineer’s Journey into Data Engineering
What Happens As You Code with AI? Beyond Vibe Coding