Ross Wightman 20d66beead Move RMSpropTF another step closer to Tensorflow impl
* init square_avg with one instead of zero as per TF
* match TF order of ops for square_avg accumulation
* move LR scaling to momentum buffer accumulator as per TF
* add decoupled weight decay flag (not in TF)
2019-05-14 18:35:43 -07:00
..