Skip to content

YichengDWu/MoYe.jl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MoYe

Stable Dev Build Status Coverage

MoYe.jl is NVIDIA's Cutlass/CuTe implemented in Julia. The primary purpose of developing this library is my desire to learn CuTe.

The name Mo Ye is derived from an ancient Chinese legend of swordsmiths.

The documentation is mostly my learning notes. Please refer to CuTe's documentation for more details.

GEMM essentially faces two main performance hurdles not implemented yet:

  1. Swizzling to prevent bank conflicts.
  2. An efficient epilogue, which involves transferring data from registers to shared memory, followed by a vectorized copy back to global memory.

Since I've sold my computer, I no longer have access to an NVIDIA GPU, thus the development of this library will be put on hold indefinitely.