PTX Instructions
- barrier.cluster
- cp.async.bulk
- cp.async.bulk.commit_group
- cp.async.bulk.wait_group
- cp.async.bulk.tensor
- cp.reduce.async.bulk
- cp.reduce.async.bulk.tensor
- fence
- getctarank
- mapa
- mbarrier.init
- mbarrier.arrive
- mbarrier.expect_tx
- mbarrier.test_wait
- mbarrier.try_wait
- red.async
- st.async
- tensormap.replace
- tensormap.cp_fenceproxy
- Special registers
Instructions by section
Instruction |
Available in libcu++ |
---|---|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
Instruction |
Available in libcu++ |
---|---|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
Instruction |
Available in libcu++ |
---|---|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
Instruction |
Available in libcu++ |
---|---|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
Instruction |
Available in libcu++ |
---|---|
No |
|
No |
|
No |
|
No |
Instruction |
Available in libcu++ |
---|---|
No |
|
No |
Instruction |
Available in libcu++ |
---|---|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
Instruction |
Available in libcu++ |
---|---|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
CCCL 2.3.0 / CUDA 12.4 |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
CCCL 2.4.0 / CUDA 12.5 |
Instruction |
Available in libcu++ |
---|---|
No |
|
No |
|
No |
|
CCCL 2.4.0 / CUDA 12.5 |
|
CCCL 2.4.0 / CUDA 12.5 |
|
No |
|
CCCL 2.4.0 / CUDA 12.5 |
|
CCCL 2.4.0 / CUDA 12.5 |
|
No |
|
CCCL 2.4.0 / CUDA 12.5 |
|
CCCL 2.4.0 / CUDA 12.5 |
|
CCCL 2.4.0 / CUDA 12.5 |
Instruction |
Available in libcu++ |
---|---|
No |
|
No |
|
No |
|
No |
Instruction |
Available in libcu++ |
---|---|
No |
|
No |
|
No |
|
No |
Instruction |
Available in libcu++ |
---|---|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
Instruction |
Available in libcu++ |
---|---|
No |
|
No |
|
CCCL 2.4.0 / CUDA 12.5 |
|
No |
|
CCCL 2.4.0 / CUDA 12.5 |
|
No |
|
No |
|
CCCL 2.3.0 / CUDA 12.4 |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
Instruction |
Available in libcu++ |
---|---|
CCCL 2.5.0 / CUDA Future |
|
No |
|
No |
|
No |
|
CCCL 2.3.0 / CUDA 12.4 |
|
No |
|
No |
|
CCCL 2.3.0 / CUDA 12.4 |
|
CCCL 2.3.0 / CUDA 12.4 |
|
No |
|
CCCL 2.4.0 / CUDA 12.5 |
Instruction |
Available in libcu++ |
---|---|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
|
No |
Instruction |
Available in libcu++ |
---|---|
No |
|
No |
|
No |
|
No |
|
No |
Instruction |
Available in libcu++ |
---|---|
No |
|
No |
|
No |
Instruction |
Available in libcu++ |
---|---|
No |
|
No |
|
No |
|
No |
Instruction |
Available in libcu++ |
---|---|
No |
|
No |
|
No |
|
No |
Instruction |
Available in libcu++ |
---|---|
No |
|
No |
|
No |
|
No |
|
No |
Instruction |
PTX ISA |
SM Version |
Available in libcu++ |
---|---|---|---|
20 |
All |
CCCL 2.4.0 / CUDA 12.5 |
|
20 |
All |
CCCL 2.4.0 / CUDA 12.5 |
|
13 |
All |
CCCL 2.4.0 / CUDA 12.5 |
|
13 |
All |
CCCL 2.4.0 / CUDA 12.5 |
|
20 |
20 |
CCCL 2.4.0 / CUDA 12.5 |
|
20 |
All |
CCCL 2.4.0 / CUDA 12.5 |
|
20 |
All |
CCCL 2.4.0 / CUDA 12.5 |
|
13 |
All |
CCCL 2.4.0 / CUDA 12.5 |
|
20 |
20 |
CCCL 2.4.0 / CUDA 12.5 |
|
30 |
30 |
CCCL 2.4.0 / CUDA 12.5 |
|
78 |
90 |
CCCL 2.4.0 / CUDA 12.5 |
|
78 |
90 |
CCCL 2.4.0 / CUDA 12.5 |
|
78 |
90 |
CCCL 2.4.0 / CUDA 12.5 |
|
78 |
90 |
CCCL 2.4.0 / CUDA 12.5 |
|
78 |
90 |
CCCL 2.4.0 / CUDA 12.5 |
|
78 |
90 |
CCCL 2.4.0 / CUDA 12.5 |
|
78 |
90 |
CCCL 2.4.0 / CUDA 12.5 |
|
20 |
20 |
CCCL 2.4.0 / CUDA 12.5 |
|
20 |
20 |
CCCL 2.4.0 / CUDA 12.5 |
|
20 |
20 |
CCCL 2.4.0 / CUDA 12.5 |
|
20 |
20 |
CCCL 2.4.0 / CUDA 12.5 |
|
20 |
20 |
CCCL 2.4.0 / CUDA 12.5 |
|
10 |
All |
CCCL 2.4.0 / CUDA 12.5 |
|
20 |
20 |
CCCL 2.4.0 / CUDA 12.5 |
|
No |
|||
No |
|||
No |
|||
31 |
31 |
CCCL 2.4.0 / CUDA 12.5 |
|
No |
|||
41 |
20 |
CCCL 2.4.0 / CUDA 12.5 |
|
81 |
90 |
CCCL 2.4.0 / CUDA 12.5 |
|
41 |
20 |
CCCL 2.4.0 / CUDA 12.5 |
|
80 |
50 |
CCCL 2.4.0 / CUDA 12.5 |