开源数据产品              AISHELL-3


共享数据,助力人工智能发展。




AISHELL-3 高保真中文语音数据库

AISHELL-3 Open Source HI-FI Mandarin Speech Corpus


       希尔贝壳中文普通话语音数据库AISHELL-3的语音时长为85小时88035句,可做为多说话人合成系统。录制过程在安静室内环境中, 使用高保真麦克风(44.1kHz,16bit)。218名来自中国不同口音区域的发言人参与录制。专业语音校对人员进行拼音和韵律标注,并通过严格质量检验,此数据库音字确率在98%以上。(支持学术研究,未经允许禁止商用。)


AISHELL-3 is a large-scale and high-fidelity multi-speaker Mandarin speech corpus which could be used to train multi-speaker Text-to-Speech (TTS) systems. The corpus contains roughly 85 hours of emotion-neutral recordings spoken by 218 native Chinese mandarin speakers and total 88035 utterances. Their auxiliary attributes such as gender, age group and native accents are explicitly marked and provided in the corpus. Accordingly, transcripts in Chinese character-level and pinyin-level are provided along with the recordings. The  word & tone transcription accuracy rate is above 98%, through professional speech annotation and strict quality inspection for tone and prosody. ( This database is free for academic research, not in the commerce, if without permission. )



85小时 | 85 Hours

88035 句 | 88035 Utterances

218 人 | 218 Speakers


语音合成实验

Text-To-Speech (TTS) Systems


开源TTS系统应用

Open Source


    • 客服
    • 电话:010-80225006
    • 邮箱:bd@aishelldata.com

Download


  数据使用申请                  Company:bd@aishelldata.com      


Service  Application          Academic Institution:aishell.foundation@gmail.com     


arxiv
aishell3 System
Readme

License: Apache License v.2.0

Sample
Netdisk
本网站由阿里云提供云计算及安全服务