LLM Training Puzzles

by Sasha Rush - srush_nlp

This is a collection of 8 challenging puzzles about training large language models (or really any NN) on many, many GPUs. Very few people actually get a chance to train on thousands of computers, but it is an interesting challenge and one that is critically important for modern AI. The goal of these puzzles is to get hands-on experience with the key primitives and to understand the goals of memory efficiency and compute pipelining.

I recommend running in Colab. Click here and copy the notebook to get start.

If you are into this kind of thing, this is 6th in a series of these puzzles.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
.gitignore		.gitignore
Distributed.html		Distributed.html
Distributed.ipynb		Distributed.ipynb
Distributed.py		Distributed.py
LICENSE		LICENSE
README.md		README.md
drawing.py		drawing.py
lib.py		lib.py
puzzles.ipynb		puzzles.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

Distributed.html

Distributed.html

Distributed.ipynb

Distributed.ipynb

Distributed.py

Distributed.py

LICENSE

LICENSE

README.md

README.md

drawing.py

drawing.py

lib.py

lib.py

puzzles.ipynb

puzzles.ipynb

requirements.txt

requirements.txt

Repository files navigation

LLM Training Puzzles

About

Releases

Packages

Contributors 5

Languages

License

srush/LLM-Training-Puzzles

Folders and files

Latest commit

History

Repository files navigation

LLM Training Puzzles

About

Topics

Resources

License

Stars

Watchers

Forks

Languages