quant_embedding
Quantized Embedding.
nn.Embedding quantization is weight-only: only the lookup table (weight) is
fake-quantized. Embedding inputs are integer indices — their input_quantizer is
registered (so config entries like "*input_quantizer" can still target it) but is
disabled by default so integer tensors pass through untouched.
Classes
alias of |
- QuantEmbedding
alias of
_QuantEmbedding