Recent Posts

Mpi processes per gpu

Some guidlines on selecting the number of MPI processes per GPU When using the GPU package, you cannot assign more than one GPU to a single MPI task Mul...

Nvidia profilers

Profiling PyTorch (PyProf) PyProf is a tool that profiles and analyzes the GPU performance of PyTorch models. PyProf aggregates kernel performa...