Skip to content

rioyokotalab/kenkyu_project

Repository files navigation

kenkyu_project

TSUBAME setup

Interactive node

qrsh -g tga-hpc-lecture -l f_node=1 -l h_rt=0:50:00 -ar 予約番号

Job schedule

qsub -g tga-hpc-lecture job.sh "python 00_numpy.py"
qsub -g tga-hpc-lecture job.sh "mpirun -np 4 python 19_regularization.py"
qsub -g tga-hpc-lecture job.sh "wandb agent rioyokotalab/kenkyu_project/ux2akgap"

Job monitor (r: 実行中, qw: 順番待ち)

qstat

Job delete

qdel ジョブID

Modules

echo '' >> ~/.bashrc
echo '# Modules' >> ~/.bashrc
echo 'source /etc/profile.d/modules.sh' >> ~/.bashrc
echo 'module load cuda openmpi nccl cudnn' >> ~/.bashrc
source ~/.bashrc

Install Pyenv Virtualenv

git clone https://github.com/yyuu/pyenv.git ~/.pyenv
git clone https://github.com/yyuu/pyenv-virtualenv.git ~/.pyenv/plugins/pyenv-virtualenv
echo '' >> ~/.bash_profile
echo '# Pyenv Virtualenv' >> ~/.bash_profile
echo 'export PYENV_ROOT="$HOME/.pyenv"' >> ~/.bash_profile
echo 'export PATH="$PYENV_ROOT/bin:$PATH"' >> ~/.bash_profile
echo 'eval "$(pyenv init -)"' >> ~/.bash_profile
echo 'eval "$(pyenv virtualenv-init -)"' >> ~/.bash_profile
source ~/.bash_profile
pyenv install 3.8.6
pyenv virtualenv 3.8.6 pytorch

Code

Clone

git clone https://github..com/rioyokotalab/kenkyu_project

Move to folder

cd kenkyu_project

Pip install

pip install -r requirements.txt

Run

python 00_numpy.py
mpirun -npernode 4 -np 8 python 12_distributed.py

Update code

git pull

WandB

wandb login

Sweep

wandb sweep sweep.yaml

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published