Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Investigate the effect of back-translations #592

Open
Tracked by #216
eu9ene opened this issue May 13, 2024 · 0 comments
Open
Tracked by #216

Investigate the effect of back-translations #592

eu9ene opened this issue May 13, 2024 · 0 comments
Labels

Comments

@eu9ene
Copy link
Collaborator

eu9ene commented May 13, 2024

We should figure out what proportion of back-translated data to use for teacher training.

For example based on this validation curve 70:30 one stage training slightly outperforms 60:40 + fine-tuning on the original corpus.
Screenshot 2024-05-13 at 11 24 45 AM

However, on evaluation we see the opposite: 59.25 vs 59.35 chrF on Flores.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant