en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

1,260 Hours - Italian(Italy) Scripted Monologue Smartphone speech dataset

Italian voice data
mobile phone voice data
voice acquisition data

Italian(Italy) Scripted Monologue Smartphone speech dataset, collected from monologue based on given prompts, covering oral; human-machine interaction; smart home command and in-car command; numbers; news domains. Transcribed with text content. Our dataset was collected from extensive and diversify speakers(3,109 native speakers), geographicly speaking, enhancing model performance in real and complex tasks.nQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Format
16kHz, 16bit, uncompressed wav, mono channel
Content category
oral category; human-machine interaction category; smart home command and in-car command category; numbers; news category
Recording condition
Low background noise (indoor)
Recording device
Android smartphone, iPhone
Country
Italy(ITA)
Language(Region) Code
it-IT
Language
Italian
Speaker
3,109 people from Italy, 48% male and 52% female
Features of annotation
Transcription text
Device
Android mobile phone, iPhone
Accuracy rate
Word Accuracy Rate(WAR) 95%
Sample Sample
  • Audio

    le undici e ventitre

  • Audio

    Com'è la primavera in DA QAIDAM ?

  • Audio

    Paolino, tua madre ti parla mai di me?

  • Audio

    Via Lattea, finalmente sappiamo com'è distribuita l'energia luminosa della galassia

  • Audio

    Mi diminuiresti la velocità dell'aria condizionata

Recommended DatasetsRecommended Dataset
2,028 Hours - Mandarin(China) Scripted Monologue Smartphone speech dataset

Mandarin(China) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering generic domain, human-machine interaction, smart home command and control, in-car command and control, numbers and other domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

mandarin speech data Scripted Monologue speech data chinese speech data
11,010 People - Mandarin(China) Digital Smartphone speech dataset

Mandarin(China) Digital Smartphone speech dataset, each speaker reads 30 sentences of 4 -8 digit number.Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Mandarin Digital voice print
18 Hours - English(Brazil) Scripted Monologue Smartphone speech dataset

English(Brazil) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering generic domain, human-machine interaction, smart home command and control, in-car command and control, numbers and other domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(55 people in total), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Accent English Brazil English
207 Hours - English(Japan) Scripted Monologue Smartphone speech dataset

English(Japan) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering generic domain, human-machine interaction, smart home command and control, in-car command and control, numbers and other domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(464 people in total), geographicly speaking, enhancing model performance in real and complex tasks.nQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Accent English Japanese Japan English
207 Hours - English(Canada) Scripted Monologue Smartphone speech dataset

English(Canada) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering generic domain, human-machine interaction, smart home command and control, in-car command and control, numbers and other domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(466 people in total), geographicly speaking, enhancing model performance in real and complex tasks.nQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Canada English Accent English asr datasets
199 Hours - English(Australia) Scripted Monologue Smartphone speech dataset

English(Australia) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering generic domain, human-machine interaction, smart home command and control, in-car command and control, numbers and other domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(402 people in total), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Australia causal speech data Australia causal data Australia causal dataset Australia causal conversation Australia causal conversation data Australia causal conversation dataset Australia causal chat data
201 Hours - English(Singapore) Scripted Monologue Smartphone speech dataset

English(Singapore) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering generic domain, human-machine interaction, smart home command and control, in-car command and control, numbers and other domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(452 people in total), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

English speech data singaporean speech dataset singapore english speech data
198 Hours - English(Malaysia) Scripted Monologue Smartphone speech dataset

English(Malaysia) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering generic domain, human-machine interaction, smart home command and control, in-car command and control, numbers and other domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(423 people in total), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Accent English Malaysia English
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

1776da70-b44f-47f6-b1f4-0d5a5ab17a71

2627a7bd-f44a-4d7c-aafb-a857ed98d953