Script.tcgen05.commit¶
- Script.tcgen05.commit(mbarrier, cta_group=1, multicast_mask=None)[source]¶
Commit pending tcgen05 async operations and signal an mbarrier.
Groups all prior uncommitted tcgen05 async operations (e.g.,
copy,mma) and signals the specified mbarrier upon completion. The mbarrier’s tx-count will be decreased when the operations finish.- Parameters:
mbarrier (Expr | RegisterTensor) – The memory barrier to signal upon completion.
cta_group (int) – The CTA group size. Must be 1 or 2.
multicast_mask (Optional[int]) – If provided, signals mbarriers on multiple CTAs in the cluster specified by the bitmask.
- Return type:
None
Notes
Thread group: Must be executed by a single warp (use
self.single_warp()).Hardware: Requires compute capability 10.0+ (sm_100).
PTX:
tcgen05.commit