A Theoretically Grounded Application of Dropout in Recurrent Neural Networks

深層学習による自然言語処理ゼロから作るDeep Learning

かの有名なDropout。ノードをランダムに消して学習を行うといいよーってな話。大量に解説ある olanleed.hatenablog.com

2018-09-24

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

深層学習による自然言語処理ゼロから作るDeep Learning

入力データの値が非常に飛び飛びだと学習に影響を及ぼすので、正規化しようねという話。日本語の解説は大量に見つかる yusuke-ujitoko.hatenablog.com

2018-09-24

Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification

深層学習による自然言語処理ゼロから作るDeep Learning

所謂Heの初期値。 ReLU関数を使うときの、重みの初期値をXXXXの範囲内にするといいよってなことが書かれている。 [内容] ・PReRUの紹介・Heの初期値 https://speakerdeck.com/satuma777/lun-wen-shao-jie-delving-deep-into-rectifiers-surpassing-human-l…

鬼城論

プロフェッショナルを目指すため日々の格闘をつづる

ゼロから作るDeep Learning

A Theoretically Grounded Application of Dropout in Recurrent Neural Networks

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification