updating SGD optimizer
embedding now uses uniform initialization
fixing broken softmax
in testing/replication we trust
coding adam
fixing introduced bug and speeing up
experimenting delta norm