numGPT

Have you ever wanted to train a Transformer that's significantly slower and probably incorrect than the many existing libraries? Well then you've come to the right place. This entire architecture was written in purely numpy and no other dependency (except pyyaml) is required.

Creating a virtual env

Simply use pipenv install and pipenv shell to create the virtual environment and get started.

Notes

This repository contains all the tools you need to construct a basic transformer using the existing layers provided. This is for LEARNING PURPOSES ONLY! Please do not try to build a production ready transformer with this code.

Supporting

Please, if you see any errors with my gradient calculations or anything that doesn't make sense, PLEASE MAKE A PULL REQUEST! I am so certain I made some mistakes in my calculations and I would love your help.

How to train a transformer

Use the train.py script and checkout the configs folder for a sample training configuration.

Downloading the vocab and merges file

Just use get_vocab.py to download the merges and vocab file to the specified folder.

Contributing

Please create a pull request if you'd like to contribute to this project. I'm a busy student but I'll be sure to review it as soon as possible!

Todo

Write clear unit tests for each module (right now each module just has testing code when ran independently.)

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
config		config
.gitignore		.gitignore
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
activations.py		activations.py
attention.py		attention.py
base.py		base.py
data.py		data.py
dropout.py		dropout.py
embedding.py		embedding.py
get_vocab.py		get_vocab.py
gpt.py		gpt.py
loss.py		loss.py
normalization.py		normalization.py
optimizers.py		optimizers.py
tokenizer.py		tokenizer.py
train.py		train.py
trainer.py		trainer.py
utils.py		utils.py

tkuye/numGPT

Folders and files

Latest commit

History

Repository files navigation

numGPT

Creating a virtual env

Notes

Supporting

How to train a transformer

Downloading the vocab and merges file

Contributing

Todo

About

Resources

Stars

Watchers

Forks

Languages