en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

The Crucial Role of Pronunciation Dictionaries in Speech Recognition

From:Nexdata Date:2023-09-19

A voice assistant is an intelligent application that helps users solve problems through smart conversations and instant question-and-answer interactions. Common voice assistants in daily life include "Siri" and "Xiao Du." These voice assistants come equipped with corresponding pronunciation dictionaries, which contain all the speech they can recognize.


A pronunciation dictionary is a dictionary that stores the pronunciation of all words and indicates their pronunciation. By using the pronunciation dictionary, a mapping relationship is established between the acoustic modeling units and the language modeling units, connecting the acoustic model and the language model. This creates a search state space that can be used by the decoder for decoding.


A sentence can be formed by combining several words, and the pronunciation dictionary allows us to obtain the phoneme sequence of the pronunciation of each word. The transition probabilities between adjacent words can be obtained through a language model, while the probability model of phonemes is mainly obtained through an acoustic model. This results in a probability model for a sentence.


In a speech recognition system, the larger the amount of data covered by the pronunciation dictionary, the higher the accuracy of speech recognition. When encountering new vocabulary, these words and their corresponding phonetic transcriptions can be added to the pronunciation dictionary, continuously expanding the vocabulary within it. It can be said that the three main factors for measuring the quality of a pronunciation dictionary are vocabulary size, phonetic labeling, and the accuracy of proofreading.


Currently, due to the need for professional control over the collection, labeling, and cleaning of pronunciation dictionaries, the performance of a speech recognition system can be impacted if there is not a large number of accurate pronunciation dictionaries that cover a wide range of vocabulary.


Nexdata Pronunciation Dictionary Corpus


80,279 Cantonese Pronunciation Dictionary

This pronunciation dictionary collects words with dialect characteristics in Guangdong cantonese regions. Each entry consists of three parts: words, pinyin and tones. The dictionary can be used to provide pronunciation reference for sound recording personnel, research and development of pronunciation recognition technology, etc.


101,702 Japanese Pronunciation Dictionary

The data contains 101,702 entries. All words and pronunciations are produced by Japanese linguists. It can be used in the research and development of Japanese ASR technology.


500,113 English Pronunciation Dictionary

The data contains 500,113 entries. All words and pronunciations are produced by English linguists. It can be used in the research and development of English ASR technology.


444,202 Korean Pronunciation Dictionary

The data contains 444,202 entries. All words and pronunciations are produced by Korean linguists. It can be used in the research and development of Korean ASR technology.

10106ee2-6108-4208-9ffb-d46ecd795092