kernels
Modules
Common (non-domain-specific) kernels. |
|
Quantization kernels: conv (implicit GEMM) and gemm (tensor_quant + Triton FP4/FP8). |
|
Sparsity kernels: attention (Triton skip-softmax backends) and gemm (placeholder). |
ModelOpt kernel library: common, quantization (conv, gemm), sparsity (attention, gemm).