Traveling Words

This is the official repository for the paper "Traveling Words: A Geometric Interpretation of Transformers".

arXiv: https://arxiv.org/pdf/2309.07315.pdf

Installation

To clone the repositorty and its dependencies run:

git clone --recurse-submodules https://github.com/santiag0m/traveling-words.git

To install requirements (Python 3.10):

pip install -r requirements.txt

Experiments

Measuring the impact of layer normalization parameters on word embeddings

python word_embeddings.py

This scripts replicates the paper results on the top-k (k=5) tokens from the word embedding matrix according to different scoring schemes:

Norm: The original embedding vector norm
Scaled Norm: The vector norm after multiplying by the layer norm scaling parameter gamma
Bias: Original norm + bias score
Scaled and Bias: Scaled norm + bias score

Probing query-key and key-value interactions from attention heads

python common_nouns.py

This script loads the 100 most common nouns in the english language and probes the attention heads of transformer blocks 0, 5 and 11 of the 124M parameter version of GPT-2, according to the geometric interpretation of attention as acting upon a high-dimensional ellipsoid containing keys and queries.

To test this with a different model (consistent with the nanoGPT implementation) or on a different block:

from utils.model_utils import load_model
from common_nouns import attention_ellipse

model, encode, decode = load_model(init_from="gpt2")
attention_ellipse(
  model=model, encode=encode, decode=decode, block_idx=0, print_top_k_nearest=5
)

Results will be saved in two separate log files: common_noun_logs_qk.txt and common_noun_logs_vo.txt

Probing left and right singular vectors from the virtual matrices `W_qk` and `W_vo`

python svd_analysis.py

This script computes the Singular Value Decomposition (SVD) of the W_qk and W_vo matrices for attention heads at blocks 0, 5 and 11 of GPT-2, and probes their singular vectors according the geometric interpretation discussed on the paper.

To test with different models or on a different block:

from utils.model_utils import load_model
from svd_analysis import attention_svd

model, encode, decode = load_model(init_from="gpt2")
attention_svd(model=model, block_idx=0, top_k_nearest=5, plot=False)

Results will be saved in two separate log files svd_qk_logs.txt and svd_vo_logs.txt

Visualize the trajectory of latent states throughout the transformer

python trajectory.py

This script plots the 3D UMAP vectors of the latent states of a given phrase (e.g. "To kill two birds with one stone") as they change throughout transformer blocks.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
nanoGPT @ eba36e8		nanoGPT @ eba36e8
scripts		scripts
utils		utils
.gitignore		.gitignore
.gitmodules		.gitmodules
COMMON_NOUNS.txt		COMMON_NOUNS.txt
README.md		README.md
common_nouns.py		common_nouns.py
requirements.txt		requirements.txt
svd_analysis.py		svd_analysis.py
trajectory.py		trajectory.py
word_embeddings.py		word_embeddings.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nanoGPT @ eba36e8

nanoGPT @ eba36e8

scripts

scripts

utils

utils

.gitignore

.gitignore

.gitmodules

.gitmodules

COMMON_NOUNS.txt

COMMON_NOUNS.txt

README.md

README.md

common_nouns.py

common_nouns.py

requirements.txt

requirements.txt

svd_analysis.py

svd_analysis.py

trajectory.py

trajectory.py

word_embeddings.py

word_embeddings.py

Repository files navigation

Traveling Words

Installation

Experiments

Measuring the impact of layer normalization parameters on word embeddings

Probing query-key and key-value interactions from attention heads

Probing left and right singular vectors from the virtual matrices `W_qk` and `W_vo`

Visualize the trajectory of latent states throughout the transformer

About

Releases

Packages

Contributors 2

Languages

santiag0m/traveling-words

Folders and files

Latest commit

History

Repository files navigation

Traveling Words

Installation

Experiments

Measuring the impact of layer normalization parameters on word embeddings

Probing query-key and key-value interactions from attention heads

Probing left and right singular vectors from the virtual matrices W_qk and W_vo

Visualize the trajectory of latent states throughout the transformer

About

Topics

Resources

Stars

Watchers

Forks

Languages

Probing left and right singular vectors from the virtual matrices `W_qk` and `W_vo`