utils

DeepSeek V3.

Classes

DeepseekV3RMSNorm

Deepseek V3 RMSNorm implementation.

class DeepseekV3RMSNorm

Bases: Module

Deepseek V3 RMSNorm implementation.

__init__(hidden_size, eps=1e-06)

DeepseekV3RMSNorm is equivalent to T5LayerNorm.

forward(hidden_states)

Forward function of DeepseekV3RMSNorm.