en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

m.nexdata.datatang.com

Speech Recognition Datasets

Instantly enhance AI model performance with high quality off-the-shelf datasets.

Language

All
7
Arabic
4
Burmese
2
Chinese Dialects
13
English
45
French
11
German
9
Hindi
6
Indonesian
8
Italian
9
Japanese
8
Korean
13
Malay
5
Mandarin
11
Others
49
Portuguese
12
Russian
6
Spanish
14
Thai
8
Vietnamese
7

Data Type

All
7
Dialogue
117
Read
111

In-Car Noise Dataset – 531 Hours of Cabin Recordings

This dataset contains 531 hours of in-car ambient noise recordings captured using microphones and mobile phones across various vehicle models, road types, speeds, and cabin conditions such as windows open or closed. The noise was recorded at six distinct points inside each vehicle to reflect spatial diversity and better support vehicle sound modeling. The dataset captures real-world driving environments, engine hums, road interactions, wind noise, and cabin reverberation. It is ideal for use cases such as noise suppression, automatic speech recognition (ASR) in cars, in-vehicle audio enhancement, and sound source separation. Validated by leading AI companies, the dataset complies fully with global data privacy regulations including GDPR, CCPA, and PIPL, making it suitable for both research and commercial applications.
in-car noise dataset vehicle interior sound car ambient noise automotive audio dataset cabin noise data car sound modeling speech enhancement training data vehicle noise cancellation dataset

In-Bus Ambient Noise Dataset - 19 Hours of Real-World Audio Recordings

This dataset contains 19 hours of real-world ambient noise recordings captured in bus environments using the Tascam DR-07x voice recorder. The audio was collected from both in-bus scenes and bus platform areas under various real-life conditions. The dataset reflects authentic public transportation soundscapes including engine hum, passenger chatter, door movement, platform announcements, and other background noises. It is suitable for a wide range of applications such as acoustic scene classification, noise suppression, automatic speech recognition (ASR) in noisy environments, and audio enhancement models. The dataset has been tested and validated by leading AI companies and adheres strictly to global data protection standards including GDPR, CCPA, and PIPL, ensuring compliance and safe use in commercial and research applications.
bus noise dataset in-bus ambient sound public transport audio data vehicle noise dataset DR-07x noise recording transportation ambient sound real-world noise dataset speech enhancement noise dataset

1,297 Hours Environmental Noise Dataset – Multi-Scenario Real-World Audio by Voice Recorder

This dataset contains 1,297 hours of environmental noise recordings collected using voice recorders across diverse real-world scenarios, including subways, supermarkets, restaurants, roads, and more. All recordings are annotated with timestamps and relevant metadata, making the data ideal for training AI models in noise reduction, environmental sound classification, audio preprocessing, and speech enhancement tasks. The dataset has been validated by leading AI companies and complies with global data protection regulations including GDPR, CCPA, and PIPL.
real-world noise data voice recorder audio subway noise dataset supermarket audio dataset noise reduction training data sound classification dataset AI training audio data

101 Hours Environmental Noise Audio Dataset – Multi-Scene Ambient Sounds Recorded by Voice Recorder

This dataset contains 101 hours of environmental noise recordings captured using high-quality voice recorders across a wide range of real-world locations, including subways, supermarkets, dining halls, city streets, airports, cinemas, expressways, and high-speed trains. It offers diverse ambient sound conditions ideal for tasks such as noise reduction, environmental sound classification, and audio preprocessing in speech or sound recognition systems. The data has been validated by major AI companies and complies with data protection regulations including GDPR, CCPA, and PIPL.
environmental noise dataset ambient sound recordings voice recorder audio dataset background noise dataset subway noise city soundscapes airport audio data sound recognition dataset AI training audio multi-scene noise dataset

Far-Field In-Home Noise Dataset – 10 Hours from Microphone Arrays

This 10-hour Far-Field In-Home Noise Dataset was collected using multiple types of microphone arrays installed in real family home environments. Each mic array setup offers varied spatial capture perspectives, making the dataset ideal for AI tasks such as far-field automatic speech recognition (ASR), voice enhancement, smart speaker training, and multi-microphone signal processing. All data has undergone rigorous quality validation and complies with global privacy regulations including GDPR, CCPA, and PIPL.
far-field audio dataset in-home noise dataset mic array audio microphone array dataset household sound recordings smart home dataset far-field ASR dataset voice enhancement data indoor noise audio dataset multi-mic speech dataset

Radio Frequency Noise Dataset – 20 Hours Indoor Microphone Audio

This dataset contains 20 hours of radio frequency noise audio recorded via high-quality microphones in 66 different rooms, with 2–4 recording points per room and multiple recording angles per point. The setup simulates real-world RF interference and ambient indoor noise scenarios, supporting tasks like sound source localization, acoustic modeling, and noise classification. The dataset has been validated by leading AI firms and complies with all major privacy regulations, including GDPR, CCPA, and PIPL.
radio frequency noise dataset RF noise audio microphone noise data ambient noise dataset indoor noise audio sound interference dataset noise detection dataset sound source localization dataset acoustic modeling data background noise recordings

205 People - Mandarin Chinese(China) Noisy Monologue Smartphone speech dataset_ Guiding

Mandarin Chinese(China) Noisy Monologue Smartphone speech dataset_ Guiding, collected from monologue based on given prompts, covering generic domain, such as in-car, smart home, voice assistant, recorded in noisy condition. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(205 people), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Noise voice accent mandarin mobile phone to collect voice data guide voice

loading

Tailor Your Data Now

Why off-the-shelf Datasets

  • Copyright

    Copyright

    Clear Coyright and Ready to Check
  • Security

    Security

    Properly Authorized Secure to Use
  • Professional

    Professional

    Designed and produced by AI data experts
  • Diversity

    Diversity

    Collected from a varity of real scenes
  • Cost Effective

    Cost Effective

    More Cost-Efficient Than Tailored Data
  • Efficiency

    Efficiency

    Ready-To-Go Deliver in Seconds
9d225e6f-1cb0-4a78-867d-bd60365c75a9