Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to use timesteps? #203

Open
blankspark opened this issue Mar 28, 2022 · 1 comment
Open

How to use timesteps? #203

blankspark opened this issue Mar 28, 2022 · 1 comment

Comments

@blankspark
Copy link

blankspark commented Mar 28, 2022

I have noticed the output of ctcdecode includes timesteps, which the description says it can be used as alignment.
But I just get shape (Batchsize,N_beams,N_timesteps). I don't know how to use it.

timesteps - Shape: BATCHSIZE x N_BEAMS

The timestep at which the nth output character has peak probability. Can be used as alignment between the audio and the transcript.

Thanks in advance.

@abarcovschi
Copy link

@blankspark have you ever figured out how to use them? I am looking to get word-level time alignments, but I don't know how to calculate this information from the timesteps returned by ctcdecode.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants