en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

English Speech Emotion Dataset – Labeled Audio from 20 Native Speakers with 10 Emotions

emotional speech dataset
English speech emotion dataset
speech emotion recognition
SER dataset
emotional audio data
labeled emotional speech
affective computing dataset
native English speakers
AI speech dataset

This English Emotional Speech Dataset features recordings from 20 native American English speakers. Each participant performed scripted monologues expressing 10 distinct emotions, including anger, happiness, sadness, fear, disgust, and others, simulating real-world scenarios. The recordings were captured via high-quality microphones and are accompanied by accurate transcriptions and relevant metadata.The dataset is ideal for training and evaluating speech emotion recognition (SER) systems, emotional TTS, affective computing, and conversational AI applications. Its geographic and speaker diversity enhances generalizability in real-life environments.All data was collected in compliance with international data privacy laws including GDPR, CCPA, and PIPL, ensuring legal and ethical use in both research and commercial settings. The dataset has been validated by multiple AI companies for performance benchmarking.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Format
44.1kHz,16bit, uncompressed wav, mono channel;
Recording condition
Low background noise(indoor), without echo ;
Content category
10 types of emotional scripts;
Recording device
Hi-Fi microphone;
Speaker
20 American, 50% male and 50% female;
Country
the United States(USA);
Language(Region) Code
en-US;
Language
English;
Features of annotation
Transcription text;
Accuracy Rate
Sentence Accuracy Rate (SAR) 95%
Sample Sample
  • Audio

    I am so happy{-laughter=mmm-} to hear that!

  • Audio

    I was overcome with joy{-laughter=haha-} when hearing the news.

  • Audio

    {-laughter=yeah-}I'm okay.

  • Audio

    This game is amazing.

  • Audio

    I was so happy{-laughter=aww-}to hear the good news.

Recommended DatasetsRecommended Dataset
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

1ac6c542-2631-4800-8c66-1606d01b0123

5d128ad7-19e2-4ef2-958e-c1053014dbd3