NLP coding
New experimental CAI self attention
MulWeights is now a function
fixes TNNetSigmoid.Backpropagate()
InitHeUniformForAllDenseLayers
Adding InitGlorotBengioUniformForAllConvLayers
CAI is again using He for convolutional layers
Convolutional layers got the same initialization as Keras
Updating CAI Transformer