Kurdish language support #660
Replies: 5 comments 5 replies
-
Hi @Olga-Yakovleva @alex19EP We will be very happy to see the Kurdish language in RHVoice. Best Regards |
Beta Was this translation helpful? Give feedback.
-
Hi |
Beta Was this translation helpful? Give feedback.
-
Greetings everyone! As the previous comment shows, developing support for a new language in RHVoice is a rather involved task. There are two development models our voices follow: funded voices and community voices. Ann example of the first are the Macedonian and Albanian voices developed last year (see https://louderpages.org). Ann example of the second are the Polish voices developed by @grzezlo and @zstanecic. The second path needs one or more developers able and willing to dedicate time and effort to the development process. One of the resources they would need is a speech corpus. Unfortunately, RHVoice doesn't implement support for building voices from multispeaker corpora such as the one linked to here. We are still using the old way of recording enough speech from a professional speaker in a high quality recording environment. I'm sorry not to be able to give easy answers. I understand that this is one of the languages truely needing a TTS voice and overlooked by the large players. But this is never simple for any language I'm afraid. |
Beta Was this translation helpful? Give feedback.
-
Hi Olga and all,
I have a python script which can process and sort the large dataset mentioned in the discussion. The script is also able to generate ssml from it.
Kurdish, especially the kurmanji variation is written in latin. Ignore Sorani, as it is written in Arabic script, which is rather tough for processing.
The dataset mentioned is for kmr, that is the Kurdish spoken in Rojawa, Turkey and parts of Iraq.
Best,
Zvonimir
From: Olga Yakovleva ***@***.***>
Sent: Wednesday, October 26, 2022 8:21 AM
To: RHVoice/RHVoice ***@***.***>
Cc: Zvonimir Stanečić ***@***.***>; Mention ***@***.***>
Subject: Re: [RHVoice/RHVoice] Kurdish language support (Discussion #660)
Greetings everyone! As the previous comment shows, developing support for a new language in RHVoice is a rather involved task. There are two development models our voices follow: funded voices and community voices. Ann example of the first are the Macedonian and Albanian voices developed last year (see https://louderpages.org). Ann example of the second are the Polish voices developed by @grzezlo <https://github.com/grzezlo> and @zstanecic <https://github.com/zstanecic> . The second path needs one or more developers able and willing to dedicate time and effort to the development process. One of the resources they would need is a speech corpus. Unfortunately, RHVoice doesn't implement support for building voices from multispeaker corpora such as the one linked to here. We are still using the old way of recording enough speech from a professional speaker in a high quality recording environment. I'm sorry not to be able to give easy answers. I understand that this is one of the languages truely needing a TTS voice and overlooked by the large players. But this is never simple for any language I'm afraid.
—
Reply to this email directly, view it on GitHub <#660 (comment)> , or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACVCDE7NX4KK5UC24KQKNMDWFDET3ANCNFSM6AAAAAARN7ZVOA> .
You are receiving this because you were mentioned. <https://github.com/notifications/beacon/ACVCDE4SEV3RPXU2Q37YUGTWFDET3A5CNFSM6AAAAAARN7ZVOCWGG33NNVSW45C7OR4XAZNRIRUXGY3VONZWS33OINXW23LFNZ2KUY3PNVWWK3TUL5UWJTQAHSDVO.gif> Message ID: ***@***.*** ***@***.***> >
|
Beta Was this translation helpful? Give feedback.
-
Hello, |
Beta Was this translation helpful? Give feedback.
-
Hi,
First time I discuss something here on the RHVoice discussion, as I sometimes use RHVoice.
I'm not a native kurdish language speaker, but recently someone asked to implement that language, as them has some voice data to create them.
If possible, will RHVoice support Kurdish? No matter the variety or only as a single language.
Best,
Luis.
Beta Was this translation helpful? Give feedback.
All reactions