convert_hf_config
Convert modelopt quantization export config to align with llm-compressor config format.
Functions
Converts modelopt quantization config dictionary to align with llm-compressor config format. |
- convert_hf_quant_config_format(input_config)
Converts modelopt quantization config dictionary to align with llm-compressor config format.
- Parameters:
input_config (dict) – The original quantization config dictionary.
- Return type:
dict
Example
{ "producer": {"name": "modelopt", "version": "0.19.0"}, "quantization": { "quant_algo": "FP8", "kv_cache_quant_algo": "FP8", "exclude_modules": ["lm_head"], }, }
- Returns:
A new dictionary in the target format.
Example (for FP8 input):
{ "config_groups": { "group_0": { "input_activations": {"dynamic": False, "num_bits": 8, "type": "float"}, "weights": {"dynamic": False, "num_bits": 8, "type": "float"}, } }, "ignore": ["lm_head"], "quant_algo": "FP8", "kv_cache_scheme": "FP8", "producer": {"name": "modelopt", "version": "0.29.0"}, }
- Parameters:
input_config (dict)
- Return type:
dict