Type Alias AttentionSoftmax<T>

AttentionSoftmax: SoftmaxLastDim<T>

Attention softmax - typically over the last dimension (key sequence) for attention weight computation