Audio driven facial animation

Bachelor thesis project providing an interactive web application with the core functionality of uploading an audio file with human speech and displaying the corresponding lip movements on the provided avatar basing on the output of an LSTM neural network model on the remote server.

Setup

First, prepare the frontend environment using the command:

npm install

Second, prepare the backend virtual environment using commands:

Windows

cd api
python3 -m venv venv
.\venv\Scripts\activate
pip install flask python-dotenv
pip install -r requirements.txt

macOS

cd api
python3 -m venv venv
source venv/bin/activate
pip install flask python-dotenv
pip install -r requirements.txt

Running locally

If you followed the Setup guide, run

python api.py

to start the backend on port 5001.

Alternatively run (depending on your OS)

yarn start-api-win

or

yarn start-api-mac

in the project source.

To start the frontend on port 3000, run:

npm start

in the project source.

Open http://localhost:3000 to view the project in the browser.

Quickstart

Press the icon in the top-left corner to display the menu.
Choose file to upload a wav or mp3 audio file with speech. Press Record to record the speech in real-time.
Press Upload to send the speech recording to the server.
When the model is performing the computations on the server, a loading spinner is displayed. Wait until it disappears.
The player options in the bottom-left corner handle the animation. Change the intensity of the avatar's expression with the slider.

demo.mp4

Public access

~~You can access the web application through the link: https://facialanimation.page/.~~ Not supported!

Use the exemplary files in audio_files folder.

Project structure

See the model preparation in the ml_model_folder.

Authors

Małgorzata Nowicka and Filip Zawadka

Name		Name	Last commit message	Last commit date
Latest commit History 114 Commits
api		api
audio_files		audio_files
ml_model_training		ml_model_training
public		public
src		src
.gitignore		.gitignore
README.md		README.md
facial-animation-app.nginx		facial-animation-app.nginx
facial-animation-app.service		facial-animation-app.service
package-lock.json		package-lock.json
package.json		package.json
yarn.lock		yarn.lock

nowickam/facial-animation

Folders and files

Latest commit

History

Repository files navigation

Audio driven facial animation

Setup

Windows

macOS

Running locally

Quickstart

Public access

Project structure

Authors

About

Topics

Resources

Stars

Watchers

Forks

Languages