VAIS-1000: A Vietnamese Speech Synthesis Corpus
This data consists of 1000 studio-quality audios and their transcription for Vietnamese northern accent.
Each utterance has a length of 14-18 words and is spoken by a single speaker.
The corpus can be used to create a Vietnamese speech synthesis system. A tutorial also available at https://vais.vn/vi/tai-ve/hts_for_vietnamese.
Instruction can be found at this website