How to convert a tacotron 2 dataset for RH voice thas uses HTS? #632
Unanswered
rmcpantoja
asked this question in
Q&A
Replies: 1 comment 3 replies
-
Hi,
It will succeed maybe, as there are 280 sentences.
Can you send me your foma and fst language data files for Spanish to test this on my setup first?
My mail:
***@***.***> ***@***.***
You will need to send me your data-only language files.
From: Mateo Cedillo ***@***.***>
Sent: Saturday, October 1, 2022 5:26 AM
To: RHVoice/RHVoice ***@***.***>
Cc: Subscribed ***@***.***>
Subject: [RHVoice/RHVoice] How to convert a tacotron 2 dataset for RH voice thas uses HTS? (Discussion #632)
Hello,
I'm beginning the process of creating a voice to contribute, reading the voice creation page on the wiki and I really don't understand much about the creation of the dataset. I know that they must be .raw files that will be in a wav folder. I assume the transcripts are in a txt file...
The Tacotron 2 dataset is similar, except that it is a wavs folder where all the audios are in wav/22050hz 16 bit mono format, and a list.txt file that contains the transcripts.
here is a dataset example for Tacotron 2 <https://drive.google.com/drive/folders/1BQXLRkjATVZWMXB0zU25J08PHdC429AA?usp=sharing> .
What do I need to do with this dataset to make it a dataset for rhVoice?
Thanks.
—
Reply to this email directly, view it on GitHub <#632> , or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACVCDEY2VVJCDDBBGDIHZB3WA6VLVANCNFSM6AAAAAAQ2IKCLU> .
You are receiving this because you are subscribed to this thread. <https://github.com/notifications/beacon/ACVCDE2B2TNM5DBAEGQWDTTWA6VLVA5CNFSM6AAAAAAQ2IKCLWWGG33NNVSW45C7OR4XAZNKIRUXGY3VONZWS33OVJRW63LNMVXHIX3JMTHAAQ5TPY.gif> Message ID: ***@***.*** ***@***.***> >
|
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello,
I'm beginning the process of creating a voice to contribute, reading the voice creation page on the wiki and I really don't understand much about the creation of the dataset. I know that they must be .raw files that will be in a wav folder. I assume the transcripts are in a txt file...
The Tacotron 2 dataset is similar, except that it is a wavs folder where all the audios are in wav/22050hz 16 bit mono format, and a list.txt file that contains the transcripts.
here is a dataset example for Tacotron 2.
What do I need to do with this dataset to make it a dataset for rhVoice?
Thanks.
Beta Was this translation helpful? Give feedback.
All reactions