cast.h¶
Functions to cast to/from FP8.
Functions
- 
void nvte_fp8_quantize(const NVTETensor input, NVTETensor output, cudaStream_t stream)¶
- Cast tensor to FP8. - Parameters
- input – [in] Input tensor to be cast. 
- output – [inout] Output FP8 tensor. 
- stream – [in] CUDA stream used for the operation. 
 
 
- 
void nvte_fp8_dequantize(const NVTETensor input, NVTETensor output, cudaStream_t stream)¶
- Cast tensor from FP8. - Parameters
- input – [in] Input tensor to be cast. 
- output – [out] Output tensor. 
- stream – [in] CUDA stream used for the operation.