RESPIN-S1.0: A read speech corpus of 10000+ hours in dialects of nine Indian Languages
#1803 · Saurabh Kumar, Abhayjeet Singh, DEEKSHITHA G, Amartya veer, Jesuraj Bandekar, Savitha Murthy, Sumit Sharma, Sandhya Badiger, Sathvik Udupa, Amala Nagireddi, Srinivasa Raghavan K M, Rohan Saxena, Jai Nanavati, Raoul Nanavati, Janani Sridharan, Arjun Mehta, Ashish S, Sai Mora, Prashanthi Venkataramakrishnan, Gauri Date, Karthika P, Prasanta Ghosh
We present RESPIN-S1.0, a large-scale, dialect-rich corpus of over 10,000 hours of read speech across nine major Indian languages: Bengali, Bhojpuri, Chhattisgarhi, Hindi, Kannada, Magahi, Maithili, Marathi, and Telugu.