warp.tile_load#

warp.tile_load(
a: Array[Any],
shape: tuple[int, ...],
offset: tuple[int, ...],
storage: str,
bounds_check: bool,
) Tile[Any, tuple[int, ...]]#
  • Kernel

  • Differentiable

Loads a tile from a global memory array.

This method will cooperatively load a tile from global memory using all threads in the block.

param a:

The source array in global memory

param shape:

Shape of the tile to load, must have the same number of dimensions as a

param offset:

Offset in the source array to begin reading from (optional)

param storage:

The storage location for the tile: "register" for registers (default) or "shared" for shared memory.

param bounds_check:

Needed for unaligned tiles, but can disable for memory-aligned tiles for faster load times

returns:

A tile with shape as specified and data type the same as the source array

warp.tile_load(
a: Array[Any],
shape: int32,
offset: int32,
storage: str,
bounds_check: bool,
) Tile[Any, tuple[int, ...]]
  • Kernel

  • Differentiable