nvfp4_tensor

Implements NVFP4 quantization for efficient tensor storage and computation.

Classes

Not implemented.

class NVFP4QTensor

Bases: BaseQuantizedTensor

Not implemented.

dequantize(dtype=torch.float16, **kwarg)

Not implemented.

classmethod get_weights_scaling_factor(input, block_size, weights_scaling_factor_2=None, keep_high_precision=False)

Not implemented.

Parameters:

classmethod get_weights_scaling_factor_2(input)

Not implemented.

classmethod quantize(input, block_size, weights_scaling_factor=None, weights_scaling_factor_2=None, keep_high_precision=False)

Not implemented.

Parameters:

classmethod resmooth_weights_and_get_scales(merged_weights, pre_quant_scales, ranks, group_size, avg_pre_quant_scale=None)

Not implemented.

Parameters: