warp.launch_tiled#
- warp.launch_tiled(*args, **kwargs)[source]#
A helper method for launching a grid with an extra trailing dimension equal to the block size.
For example, to launch a 2D grid, where each element has 64 threads assigned you would use the following:
wp.launch_tiled(kernel, [M, N], inputs=[...], block_dim=64)
Which is equivalent to the following:
wp.launch(kernel, [M, N, 64], inputs=[...], block_dim=64)
Inside your kernel code you can retrieve the first two indices of the thread as usual, ignoring the implicit third dimension if desired:
@wp.kernel def compute() i, j = wp.tid() ...