You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is an eccentricity of the Transformer language model we use for translation not an issue with the LibreTranslate code.
This isn't the first time ♪ symbols have been generated incorrectly. I'm guessing the root cause is the OpenSubtitles dataset which is used for traning. OpenSubtitles has a lot of translated subtitles for movies and TV shows and I'm guessing a lot of ♪ characters too.
I used to filter certain special characters from the dataset before training the models, which might help with this issue, but I removed the filtering a while ago because it was slow.
When translating hashtags from Russian to English, the
#
becomes a♪
Russian:
English translation:
Example: https://lor.sh/@skobkin/111977814820972935 as viewed from Hachyderm
The text was updated successfully, but these errors were encountered: