warp.untile#

warp.untile(a: Tile[Any, tuple[int, ...]]) → Any#

Kernel

Differentiable

Convert a tile back to per-thread values.

This function converts a block-wide tile back to per-thread values.

If the input tile is 1D, then the resulting value will be a per-thread scalar
If the input tile is 2D, then the resulting value will be a per-thread vector of length M

Parameters:: a – A tile with dimensions shape=(M, block_dim)
Returns:: A single value per-thread with the same data type as the tile.

Example

This example shows how to create a linear sequence from thread variables:

@wp.kernel
def compute():
    i = wp.tid()

    # create block-wide tile
    t = wp.tile(i)*2

    # convert back to per-thread values
    s = wp.untile(t)

    print(s)

wp.launch(compute, dim=16, inputs=[], block_dim=16)

0
2
4
6
8
...