There is an unsaved comment in progress. You will lose your changes if you continue. Are you sure you want to reopen the work item?
Optimize tile_local_reduction using warp information
In tile_local_reduction(), if we can get warp information, we can optimize this further by removing some of the waits and local bounds checks