Blogs Map
Paper Reading
Study on matrix units of AMD and NVIDIA GPUs
Investigation on NVIDIA Tensor cores v.s. AMD Matrix cores
- AMD matrix cores
- NVIDIA tensor cores
Matrix Multiplication Background - Tiled Matrix Multiplication -- CUDA implementation
- Programming tensor cores using nvcuda-wmma
GPU performance optimization
NVIDIA GPU Performance Background
FPChecker
- FPChecker Installation
- FPChecker issue -- cannot link the openMP lib
- FPChecker exploration -- mixed precision