Please fill in your name
Mobile phone format error
Please enter the telephone
Please enter your company name
Please enter your company email
Please enter the data requirement
Successful submission! Thank you for your support.
Format error, Please fill in again
The data requirement cannot be less than 5 words and cannot be pure numbers
Recently, scientists has performed a speech recognition capability test on some voice assistant on the market. Researchers found voice assistants including Amazon Echo, Google Home and other devices had recognition errors in the scene of interacting with children.
Different from adults, children’s voices have natural technical difficulties due to their voice and pronunciation characteristics. More importantly, children are not good at interacting with the voice assistant with the way that machines can understand. Whether it is a more friendly interactive interface or a more intelligent voice assistant, the recognition effect is not satisfactory.
The importance of high-quality children speech data is evident, in order to train a smarter voice assistant. As a professional AI data services provider, Datatang has accumulated 4,000 hour high-quality children speech data, to supports the research and application of children voice interactive products.
Mobile phone captured audio data of Chinese children, with total duration of 3,255 hours. 9,780 speakers are children aged 6 to 12, with accent covering seven dialect areas; the recorded text contains common children languages such as essay stories, numbers, and their interactions on cars, at home, and with voice assistants, precisely matching the actual application scenes.
Children read English audio data, covering ages from preschool (3–5 years old) to post-school (6–12 years old), with children’s speech features; content accurately matches children’s actual scenes of speaking English. It provides data support for children’s smart home, automatic speech recognition and oral assessment in intelligent education scene.
It is recorded by 219 American children native speakers. The recording texts are mainly storybook, children’s song, spoken expressions, etc. 350 sentences for each speaker. Each sentence contain 4.5 words in average. Each sentence is repeated 2.1 times in average.
It collects 201 British children. The recordings are mainly children textbooks, storybooks. The average sentence length is 4.68 words and the average sentence repetition rate is 6.6 times. This data is recorded by high fidelity microphone.
If the above data cannot meet the needs of your current research, Datatang also provides data customization services for specific groups of people, specific scenarios, and specific languages to meet customers’ diversified data needs.
If you need data services, please feel free to contact us: firstname.lastname@example.org
<p class="iq ir hl is b it iu hp iv iw ix ht iy iz ja jb jc jd je jf jg jh ji jj jk jl dm ii" data-selectable-paragraph="" id="6858">The Scale.up 360 Sensor & Radar Systems Europe 2021 Conference was held online from November 17 to 18, 2021. The digital event is involved about the latest advances and technologies of Sensor Fusion — Camera, Radar and LiDAR in ADAS & High Level Autonomous Driving.</p>
<p class="iq ir hl is b it iu hp iv iw ix ht iy iz ja jb jc jd je jf jg jh ji jj jk jl dm ii" data-selectable-paragraph="" id="5038">With the expansion of AI applications, dialect recognition has received increasing attention. However, due to the huge difference between Chinese dialects and Mandarin, the speech recognition of Chinese dialects is much more complicated.</p><p class="iq ir hl is b it iu hp iv iw ix ht iy iz ja jb jc jd je jf jg jh ji jj jk jl dm ii" data-selectable-paragraph="" id="5de4">Generally speaking, the speech data collection is to record commonly used sentences and words through text, phonetic…</p>