cast.h¶

Functions to cast to/from FP8.

Functions

void nvte_fp8_quantize(const NVTETensor input, NVTETensor output, cudaStream_t stream)¶

Cast tensor to FP8.

Parameters

void nvte_fp8_dequantize(const NVTETensor input, NVTETensor output, cudaStream_t stream)¶

Cast tensor from FP8.

Parameters