Please fill in your name
Mobile phone format error
Please enter the telephone
Please enter your company name
Please enter your company email
Please enter the data requirement
Successful submission! Thank you for your support.
Format error, Please fill in again
The data requirement cannot be less than 5 words and cannot be pure numbers
A major problem with speech recognition datasets on the market is that they focus on European languages or English. For the realization of uncommon language speech recognition, due to the great differences between different languages, artificial intelligence manufacturers need to model separately according to different language characteristics. In order to ensure the effect of speech recognition, high-quality speech recognition dataset in different languages are needed for model optimization. However, the scarcity of high-quality uncommon language speech recognition dataset has become a major bottleneck in speech recognition.
As the world's leading AI data service provider, Datatang currently has pre-labeled speech recognition dataset in more than 30 uncommon languages, which can meet the needs of speech recognition in most uncommon languages. Datatang strictly abides by the relevant regulations, and all the collected speech recognition datasets have been authorized by the person being collected.
Datatang Uncommon Language Speech Recognition Dataset
1751 Vietnamese native speakers participated in the recording with authentic accent. The recorded script is designed by linguists and covers a wide range of topics including generic, interactive, on-board and home. The text is manually proofread with high accuracy. It matches with mainstream Android and Apple system phones.
Thai Speech Recognition Dataset (reading) is collected from 498 Thailand native speakers and is recorded in quiet environment. The recording is rich in content, covering multiple categories such as economics, entertainment, news, figure, and oral. Around 400 sentences for each speaker. The valid data volume is 292 hours. All texts are manual transcribed with high accuracy.
The data is 759 hours long and was recorded by 1,425 Indian native speakers. The accent is authentic. The recording text is designed by language experts and covers general, interactive, car, home and other categories. The text is manually proofread, and the accuracy is high. Recording devices are mainstream Android phones and iPhones. Hindi Speech Recognition Dataset can be applied to speech recognition, machine translation, and voiceprint recognition.
156 Speakers - Mobile Telephony Malay Speech Recognition Dataset_Reading is recorded by native Malay speakers in the quiet environment. The recording is rich in content, covering multiple categories such as economy, entertainment, news, oral language, numbers, and letters. Around 450 sentences for each speaker. The effective time is 134 hours. All texts are manually transcribed to ensure high accuracy.
1285 Indonesian native speakers participated in the recording with authentic accent. The recorded script is designed by linguists and cover a wide range of topics including generic, interactive, on-board and home. The text is manually proofread with high accuracy. It matches with mainstream Android and Apple system phones. Indonesian Speech Recognition Dataset can be applied for automatic speech recognition, and machine translation scenes.
If you want to know more details about the speech recognition datasets or how to acquire, please feel free to contact us: [email protected].
In the past ten years, driven by deep learning, speech recognition technology and applications have achieved rapid development. Related products and services equipped with speech recognition technology, such as voice search, voice input method, smart speaker, smart TV, smart wearable, intelligent customer service, robots, etc. have been widely used in all aspects of our lives
According to Deloitte statistics, it is estimated that by 2030, China's smart voice consumer and enterprise application markets will exceed 70 billion Yuan and 100 billion Yuan respectively. From a global perspective, the scale of the global intelligent voice industry will reach US$35.12 billion in 2022, maintaining a high growth rate of 33.1%.