You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I would like to know the estimated final GST value for each utterance in the training data, and the estimated GST value from the reference audio during synthesis.
What part of the script do I need to change to make it output to a log or file?
I looked at the script and thought it might be stored in the value style_embs, but I wasn't sure if it was stored in the model.
Which script is doing the data input/output and modeling?
The dataset contains two speech styles and uses the acoustic model fastspeech2 (no special parameters other than GST such as sid, single model like LJSpeech)
I would like to change the speech style by manually adjusting the GST in the future, so I think understanding the GST value would be helpful.
Sorry for my lack of knowledge, but could someone please help me?
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Hi all,
I would like to know the estimated final GST value for each utterance in the training data, and the estimated GST value from the reference audio during synthesis.
What part of the script do I need to change to make it output to a log or file?
I looked at the script and thought it might be stored in the value style_embs, but I wasn't sure if it was stored in the model.
Which script is doing the data input/output and modeling?
The dataset contains two speech styles and uses the acoustic model fastspeech2 (no special parameters other than GST such as sid, single model like LJSpeech)
I would like to change the speech style by manually adjusting the GST in the future, so I think understanding the GST value would be helpful.
Sorry for my lack of knowledge, but could someone please help me?
Beta Was this translation helpful? Give feedback.
All reactions