D0x0000FF@tg says to YSITD
譬如說你weight_1調整1,loss可以降低1,然後你weight _2 調整1 loss卻降了100,這時候你會比較傾向讓weight_2調整多一點,這樣會比較快達到min