speechscorer: a simple unsupervised spoken utterance scorer.

This package enables you to assign a score to a given spoken utterance (audio). This score can reflect the intelligbility or the fluency/grammaticality of the utterance.

Different models pretrained with different training objectives are proposed: HuBERT, Whisper and WavLM.

How it works

The scoring method relies on the internal model hesitation (entropy) while predicting for the input speech. Depending on the model and its training objective, the entropy is computed differently. You can learn more about how it works by reading the documentation:

Getting start

You will first need to istall the right PyTorch for your computer

Then you can install speechscorer using pip:

pip install git+https://github.com/yaya-sy/speechscorer.git

once installed, you can score your utterance with this command:

speechscore -a <your-audio>

To see more available options, run:

speechscore -h

Demo

You can find a colab notebook in demo/speechscorer.ipynb for an example of use case.

Biases

The models are trained using speech from dominant populations: Western, affluent and white man ways of speaking English. Model results are therefore strongly influenced by the way English is spoken by these dominant populations. Scores can be low for native English but from under-represented populations (Jamaican English, Nigerian English, etc.) and from women.

Citation

To refenrece speechscorer in your own work, please add the following citation.

@article{SY2023,
  url = {https://github.com/yaya-sy/speechscorer},
  year = {2023},
  author = {Yaya SY},
  title = {speechscorer: A simple unsupervised spoken utterances scorer},
  journal = {Journal of Open Source Software}
}

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
demo		demo
plots		plots
speechscorer		speechscorer
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

demo

demo

plots

plots

speechscorer

speechscorer

.gitignore

.gitignore

README.md

README.md

requirements.txt

requirements.txt

setup.py

setup.py

Repository files navigation

speechscorer: a simple unsupervised spoken utterance scorer.

How it works

Getting start

Demo

Biases

Citation

About

Releases

Packages

Contributors 2

Languages

yaya-sy/speechscorer

Folders and files

Latest commit

History

Repository files navigation

speechscorer: a simple unsupervised spoken utterance scorer.

How it works

Getting start

Demo

Biases

Citation

About

Topics

Resources

Stars

Watchers

Forks

Languages