Synchronize the calling CPU thread with any outstanding CUDA work on the specified device
This function allows the host application code to ensure that all kernel launches
and memory copies have completed on the device.
- Parameters:
device (Device | str | None) – Device to synchronize.