cuDecomp
  • Overview
    • What is cuDecomp?
    • Decomposition Layout
  • Basic Usage Guide
    • Starting up cuDecomp
    • Creating a grid descriptor
    • Allocate pencil memory
    • Working with pencil data
      • C/C++
      • Fortran
    • Allocating workspace
    • Transposing the data
    • Updating halo regions
    • Cleaning up and finalizing the library
    • Building and running the example
  • Autotuning
    • Autotuning process
    • Autotuning usage
      • Creating a grid descriptor with autotuning enabled
      • Autotuner output and querying results
  • Working with NVSHMEM-enabled builds
    • Controlling the symmetric heap size
    • MPI compatibility
  • cuDecomp API
    • cuDecomp C API
      • Types
        • Internal types
        • Grid Descriptor Configuration
        • Pencil Information
        • Communication Backends
        • Additional Enumerators
      • Functions
        • Library Initialization/Finalization
        • Grid Descriptor Management
        • Workspace Management
        • Helper Functions
        • Transposition Functions
        • Halo Exchange Functions
    • cuDecomp Fortran API
      • Types
        • Internal types
        • Grid Descriptor Configuration
        • Pencil Information
        • Communication Backends
        • Additional Enumerators
      • Functions
        • Library Initialization/Finalization
        • Grid Descriptor Management
        • Workspace Management
        • Helper Functions
        • Transposition Functions
        • Halo Exchange Functions
  • Environment Variables
    • CUDECOMP_ENABLE_NCCL_UBR
    • CUDECOMP_ENABLE_CUMEM
    • CUDECOMP_ENABLE_CUDA_GRAPHS
    • CUDECOMP_ENABLE_PERFORMANCE_REPORT
    • CUDECOMP_PERFORMANCE_REPORT_DETAIL
    • CUDECOMP_PERFORMANCE_REPORT_SAMPLES
    • CUDECOMP_PERFORMANCE_REPORT_WARMUP_SAMPLES
    • CUDECOMP_PERFORMANCE_REPORT_WRITE_DIR
    • CUDECOMP_USE_COL_MAJOR_RANK_ORDER
cuDecomp
  • Search


© Copyright 2022-2025, NVIDIA Corporation.


Documentation built from commit 12c1bb845cb9a8d3c91f968e57fb45217dd6b7f4.
Built with Sphinx using a theme provided by Read the Docs.