cuda-bindings 13.3.0 Release notes#
Highlights#
Bugfixes#
Fixed a use-after-free in
cudaGraphGetEdges,cudaGraphNodeGetDependencies,cudaGraphNodeGetDependentNodes,cudaStreamGetCaptureInfo, and their driver-API counterparts (cuGraphGetEdges,cuGraphNodeGetDependencies,cuGraphNodeGetDependentNodes,cuStreamGetCaptureInfo). The returnedcudaGraphEdgeData/CUgraphEdgeDatawrappers were backed by a scratch buffer that was freed before the call returned, leaving every wrapper holding a dangling pointer. The returned wrappers now own deep copies of the edge data. (Issue #1804)Fixed a double-free in the generated setters for list-valued struct members (e.g.
CUlaunchConfig.attrs,CUDA_MEM_ALLOC_NODE_PARAMS.accessDescs, external-semaphore and batch-mem-op node parameter arrays, and their runtime counterparts). Assigning an empty list freed the internal buffer but left the cached pointer non-NULL, so a subsequent assignment or__dealloc__would callfree()again on the dangling pointer.
Miscellaneous#
Known issues#
Updating from older versions (v12.6.2.post1 and below) via
pip install -U cuda-pythonmight not work. Please do a clean re-installation by uninstallingpip uninstall -y cuda-pythonfollowed by installingpip install cuda-python.nvml.system_get_process_nameon WSL can return incorrect values. To work around this, set the locale to “C” before callingnvml.device_get_compute_running_processes_v3(which sets the process names) and before callingnvml.system_get_process_name.cuda_coredoes this automatically, but users of the raw NVML API will need to do this manually.