You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
While OpenCL on NVIDIA GPUs requires CL_DEVICE_WARP_SIZE_NV, the equivalent query on AMD GPUs is CL_DEVICE_WAVEFRONT_WIDTH_AMD.
GPUs from different vendors require different queries. Since the warp size can vary from each GPU vendor or GPU model, collecting the warp size is relevant to organizing the GPU thread hierarchy according to the underlying hardware, improving the portability from the GSParLib code. For instance, a best practice is creating thread blocks where the block size is a multiple of the GPU warp size.
Currently, GSParLib only allows collecting the warp size from NVIDIA GPUs as presented in this code line: https://github.com/GMAP/GSParLib/blob/main/src/GSPar_OpenCL.cpp#L438
While OpenCL on NVIDIA GPUs requires CL_DEVICE_WARP_SIZE_NV, the equivalent query on AMD GPUs is CL_DEVICE_WAVEFRONT_WIDTH_AMD.
GPUs from different vendors require different queries. Since the warp size can vary from each GPU vendor or GPU model, collecting the warp size is relevant to organizing the GPU thread hierarchy according to the underlying hardware, improving the portability from the GSParLib code. For instance, a best practice is creating thread blocks where the block size is a multiple of the GPU warp size.
This link contains a bit of discussion about warp size on NVIDIA, AMD, and Intel GPUs: https://stackoverflow.com/questions/77563895/are-there-any-warp-size-or-wavefront-width-equivalent-on-intel-gpus
The text was updated successfully, but these errors were encountered: