You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
DaCiDian is an open-sourced lexicon for Chinese Automatic Speech Recognition(ASR)
Design
In mainstream ASR system, lexicon is a core component, that maps word into acoustic modeling units(such as phone). In DaCiDian, we break the mapping into 2 independent layers:
word --> PinYin syllable --> phoneme
The purpose of this design is as follows:
Anyone who is familiar with PinYin (basically every mandarin speaker), can enrich DaCiDian's vocabulary, by adding new entry(word) into the layer-1 mapping.
ASR system developers can easily adapt DaCiDian to their own phone set by defining their own layer-2 mapping.
Layer-1: Word -> Syllable Mapper (word_to_pinyin.txt)
word and its pronunciation(s) are seperated by tab
multiple pronunciations are seperated by ;
each pronunciation is a sequence of Chinese PinYin syllables, seperated by space
pinyin_to_phone is a user-defined mapping from PinYin syllables to target phone set
Take traditional PinYin's Initial-Final structure for example, a mapping should be defined as follows:
A $0 a
AI $0 ai
AN $0 an
ANG $0 ang
AO $0 ao
BA b a
BAI b ai
BAN b an
BANG b ang
BAO b ao
...
...
...
ZONG z ong
ZOU z ou
ZU z u
ZUAN z uan
ZUI z ui
ZUN z un
ZUO z uo
syllable and its phoneme representation are seperated by tab
$0 is the NULL-Initial phone.
Notes
English word in Chinese context is a problem. An upgraded version covering both English and Chinese can be found here BigCiDian
About
DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)