coding transformers
Adding experimental layer TNNetMovingScale
Coding transformers
coding multi-head attention