Script.atomic.global_sub

Script.atomic.global_sub

Script.atomic.global_sub(dst, values, *, sem='relaxed', scope='gpu', output=None)[source]

Element-wise dst[i] = dst[i] - values[i] atomically, on global memory.

Lowered to atom.add with a negated operand; see global_add().

Notes

  • Thread group: Can be executed by any sized thread group.

  • Hardware: Requires compute capability 7.0+ (sm_70).

  • PTX: atom.{sem}.{scope}.global.add.s32 with a negated input.

Parameters:
Return type:

RegisterTensor | None