en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

m.nexdata.datatang.com

144 Hours Arabic Speech Dataset with Transcriptions for Speech Recognition

arabic speech dataset
arabic voice dataset
arabic speech corpus
arabic audio dataset
speaker diarization dataset

This dataset contains 144 hours of Arabic conversational speech recorded through spontaneous dialogues using smartphones. It includes high-quality transcriptions, speaker IDs, gender, and additional metadata. The recordings were collected from speakers across diverse geographic regions and demographic backgrounds, helping improve model performance in real-world conversational speech applications. The dataset has been quality-validated by multiple AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Format
16kHz, 16 bit, wav, mono channel;
Content category
Recorders in free conversation without a set topic;
Recording condition
Low background noise (indoor);
Recording device
Android smartphone, iPhone;
Language
Arabic;
Features of annotation
Transcription text, timestamp, speaker ID, gender.
Accuracy Rate
Word Accuracy Rate (WAR) 97%
Sample Sample
  • Audio

    هل في طريقة أزيد فيها مستوى التأمين على حسابي؟

  • Audio

    وإذا كنت أحتاج مستند يوضح تفاصيل التأمين لحساباتي، كيف أقدر أحصله؟

  • Audio

    طيب وش الإجراءات اللي تتم في حال صار أي خلل في البنك، كيف أقدر استرجع فلوسي؟

  • Audio

    يعني ما يحتاج أقدم طلب وأتابع الموضوع بنفسي؟

  • Audio

    تمام، بخصوص الحسابات اللي مسجل فيها أكثر من مستفيد، كيف يتم التعامل معها في التأمين؟

Recommended DatasetsRecommended Dataset
Tell Us Your Special Needs

Current Project Maturity

Early exploration (no concrete specs yet)
Defined goals, need professional guidance
Active development or optimization phase
Data & labeling experts with clear specifications

By submitting, I agree to the Privacy Protection

fd5a4222-494c-46b5-94a2-e0d61485280e

0a205e6b-1e5a-448b-ba52-3ae1249e43e0