Extended API# Bit Execution model Host threads Device threads CUDA APIs Memory model Thread Scopes Synchronization primitives Atomicity Data Races Example: Message Passing Thread Groups Data Members Member Functions Notes Example Synchronization Primitives Asynchronous Operations Memory access properties Functional Fancy Iterators constant_iterator counting_iterator discard_iterator permutation_iterator shuffle_iterator strided_iterator tabulate_output_iterator transform_input_output_iterator transform_iterator transform_output_iterator zip_iterator zip_function Iterators Type traits Numeric Memory Streams Memory Resources Math Mdspan Warp Utility Work stealing Example