AISHELL-3 高保真中文语音数据库

AISHELL-3 Open Source HI-FI Mandarin Speech Corpus

      希尔贝壳中文普通话语音数据库AISHELL-3的语音时长为85小时88035句,可做为多说话人合成系统。录制过程在安静室内环境中, 使用高保真麦克风(44.1kHz,16bit)。218名来自中国不同口音区域的发言人参与录制。专业语音校对人员进行拼音和韵律标注,并通过严格质量检验,此数据库音字确率在98%以上。

 

 

AISHELL-3 is a large-scale and high-fidelity multi-speaker Mandarin speech corpus which could be used to train multi-speaker Text-to-Speech (TTS) systems. The corpus contains roughly 85 hours of emotion-neutral recordings spoken by 218 native Chinese mandarin speakers and total 88035 utterances. Their auxiliary attributes such as gender, age group and native accents are explicitly marked and provided in the corpus. Accordingly, transcripts in Chinese character-level and pinyin-level are provided along with the recordings. The  word & tone transcription accuracy rate is above 98%, through professional speech annotation and strict quality inspection for tone and prosody. 

语音合成实验

Text-To-Speech (TTS) Systems

85小时 | 85 Hours

88035 句 | 88035 Utterances

218 人 | 218 Speakers

开源TTS系统应用

Open Source

Non-Open Source

 

       数据使用申请                Company:bd@aishelldata.com      

 

Service  Application          Academic Institution:aishell.foundation@gmail.com     

数据下载

 

论 文

 

License: Apache License v.2.0

Dataset
arxiv

基线系统

 

相关课程

 

AISHELL-3

语音合成实战

 

了解课程详情
Recipe