dflash

Modules

modelopt.torch.speculative.dflash.conversion

DFlash conversion/restore utilities.

modelopt.torch.speculative.dflash.default_config

Default DFlash architecture config.

modelopt.torch.speculative.dflash.dflash_model

DFlash model to support block-wise parallel speculative decoding.

DFlash Optimization Method.