Interactive Layout Demo - Tilus

Type a layout expression, press Enter, and see how tensor elements are distributed across threads. Each cell shows T<thread_id> and the local index. Cells are color-coded by thread. Hover to highlight all elements owned by the same thread, or click a thread in the legend.

Expression Syntax

Primitives

spatial(d0, d1, ...) — distribute elements across threads
local(d0, d1, ...) — store elements in each thread's local registers
column_spatial(d0, d1, ...) — spatial in column-major order
column_local(d0, d1, ...) — local in column-major order

Chained Product

local(3, 4).spatial(2, 3) — equivalent to product(local(3, 4), spatial(2, 3))

Operations

product(A, B) or A * B — Kronecker-like product of two layouts
divide(A, B) or A / B — divide layout A by B
reduce(layout, [dim0, ...]) — reduce over dimensions (creates replicated threads)
permute(layout, [d0, ...]) — permute dimensions
reshape(layout, [s0, ...]) — reshape to new shape

Try These

local(4, 4) — single thread holds all 16 elements
spatial(4, 8).local(4, 4) — 32 threads with 16 elements each
reduce(spatial(3, 4), [0]) — replicated elements across threads
local(2, 1).spatial(8, 4).local(1, 2) — MMA m16n8k8 output layout