Skip to content

danwilhelm/llmsbook

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LLM Foundations (early book draft)

See the book live at https://llms.danwil.com.

by Dan Wilhelm

In this book, we develop a low-level understanding of Large Language Models (LLMs). To understand large circuits, we apply learnings from mathematics and the analysis of small circuits.

This book has an accompanying YouTube channel and project GitHub repo (coming soon).

To get the most out of this book, we recommend the reader be fluent in Python and know the basics of NumPy, linear algebra, and machine learning. Therefore, our target audience includes researchers, CS students, and software engineers with an interest in LLMs.

Currently, our focus is on analysis rather than training.


Building the site

This site is built using the Rust static-site generator mdBook.

To build the site:

  1. Install mdBook.
  2. Install the katex preprocessor: cargo install mdbook-katex.
  3. Install the mermaid preprocessor: cargo install mdbook-mermaid.
  4. From the project directory, run mdbook build. The output will be in the book directory.
  5. Alternatively, to view the site immediately in watch mode run mdbook serve --open.

Note: Also uses a table-of-contents modification that does not require installation.

About

Exploring LLMs and their foundations

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published