Script.atomic.global_sub¶
- Script.atomic.global_sub(dst, values, *, sem='relaxed', scope='gpu', output=None)[source]¶
Element-wise
dst[i] = dst[i] - values[i]atomically, on global memory.Lowered to
atom.addwith a negated operand; seeglobal_add().Notes
Thread group: Can be executed by any sized thread group.
Hardware: Requires compute capability 7.0+ (sm_70).
PTX:
atom.{sem}.{scope}.global.add.s32with a negated input.
- Parameters:
dst (GlobalTensor)
values (RegisterTensor)
sem (str)
scope (str)
output (RegisterTensor | None)
- Return type:
RegisterTensor | None