common
Modules
Composable Triton JIT functions for FP8 (E4M3) fake quantization. |
|
Composable Triton JIT functions for NVFP4 (E2M1) fake quantization. |
Shared composable Triton JIT fake-quantization functions.
Format-level building blocks (FP8 E4M3, NVFP4/E2M1) reused across the gemm and attention kernel packages.