pruning_utils
Classes
Functions
Convert pruning_mixin argument to PruningMixIn instance(s). |
- class GQAInitMode
Bases:
Enum- AverageKV = 'AverageKV'
- CopyAsIs = 'CopyAsIs'
- Degrouping = 'Degrouping'
- FirstKV = 'FirstKV'
- PruneKVHeads = 'PruneKVHeads'
- RandomBlock = 'RandomBlock'
- RandomKV = 'RandomKV'
- class HiddenSizeInitMode
Bases:
Enum- CopyAsIs = 'CopyAsIs'
- PruneByChannelRanking = 'PruneByChannelRanking'
- Random = 'Random'
- Truncate = 'Truncate'
- class MlpInitMode
Bases:
Enum- ConcatExpertsIntoDenseFFN = 'ConcatExpertsIntoDenseFFN'
- CopyAsIs = 'CopyAsIs'
- ExpertRemoval = 'ExpertRemoval'
- PruneByActivationsLog = 'PruneByActivationsLog'
- Random = 'Random'
- Truncate = 'Truncate'
- resolve_pruning_mixin(pruning_mixin, descriptor)
Convert pruning_mixin argument to PruningMixIn instance(s).
- Parameters:
pruning_mixin – Can be a string identifier, PruningMixIn instance, or a list of any of those types.
descriptor (Type[ModelDescriptor]) – ModelDescriptor class that provides the pruning_mixins() mapping.
- Returns:
PruningMixIn or List[PruningMixIn] depending on input type.
- Return type:
PruningMixIn | List[PruningMixIn]