Multi-Way, Multilingual Neural Machine Translation with a Shared Attention Mechanism

[1601.01073] Multi-Way, Multilingual Neural Machine Translation with a Shared Attention Mechanism 日本語の解説なし

2018-09-24

Using the Output Embedding to Improve Language Models

[1608.05859] Using the Output Embedding to Improve Language Models 以下の論文と一緒に言及されることが多い Tying Word Vectors and Word Classifiers: A Loss Framework for Language Modeling

2018-09-24

Curriculum Learning

深層学習による自然言語処理

日本語の解説も多少ある Curriculum Learning （関東CV勉強会） from 祥孝牛久 www.slideshare.net

2018-09-24

TYING WORD VECTORS AND WORD CLASSIFIERS: A LOSS FRAMEWORK FOR LANGUAGE MODELING

深層学習による自然言語処理

https://www.slideshare.net/takahirokubo7792/onehot-to-distribution-in-language-modeling

2018-09-24

Layer Normalization

深層学習による自然言語処理

解説はBatch Normとあわせる形でちらほら。 Layer Normalization@NIPS+読み会・関西 from Keigo Nishida www.slideshare.net

2018-09-24

A Theoretically Grounded Application of Dropout in Recurrent Neural Networks

深層学習による自然言語処理ゼロから作るDeep Learning

かの有名なDropout。ノードをランダムに消して学習を行うといいよーってな話。大量に解説ある olanleed.hatenablog.com

2018-09-24

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

深層学習による自然言語処理ゼロから作るDeep Learning

入力データの値が非常に飛び飛びだと学習に影響を及ぼすので、正規化しようねという話。日本語の解説は大量に見つかる yusuke-ujitoko.hatenablog.com

2018-09-24

Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification

深層学習による自然言語処理ゼロから作るDeep Learning

所謂Heの初期値。 ReLU関数を使うときの、重みの初期値をXXXXの範囲内にするといいよってなことが書かれている。 [内容] ・PReRUの紹介・Heの初期値 https://speakerdeck.com/satuma777/lun-wen-shao-jie-delving-deep-into-rectifiers-surpassing-human-l…

2018-09-24

Google’s Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation

深層学習による自然言語処理

英語を読まなくても、図をみてるだけで興味深い。 Google翻訳のAIは独自の「中間言語」を習得して「学習してない言語間の翻訳」すら可能な段階に突入 - GIGAZINE www.yasuhisay.info

2018-09-24

ADAM: A METHOD FOR STOCHASTIC OPTIMIZATION

深層学習による自然言語処理

postd.cc

2018-09-24

PRACTICAL BAYESIAN OPTIMIZATION OF MACHINE LEARNING ALGORITHMS

深層学習による自然言語処理

https://arxiv.org/abs/1206.2944 日本語解説ベイズ的最適化(Bayesian Optimization)の入門とその応用 from issei_sato www.slideshare.net 論文紹介:Practical bayesian optimization of machine learning algorithms(nips2012) from Keisuke Uto www.slid…

鬼城論

プロフェッショナルを目指すため日々の格闘をつづる

深層学習による自然言語処理

Multi-Way, Multilingual Neural Machine Translation with a Shared Attention Mechanism

Using the Output Embedding to Improve Language Models

Curriculum Learning

TYING WORD VECTORS AND WORD CLASSIFIERS: A LOSS FRAMEWORK FOR LANGUAGE MODELING

Layer Normalization

A Theoretically Grounded Application of Dropout in Recurrent Neural Networks

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification

Google’s Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation

ADAM: A METHOD FOR STOCHASTIC OPTIMIZATION

PRACTICAL BAYESIAN OPTIMIZATION OF MACHINE LEARNING ALGORITHMS