Mixture-of-Experts-Survey
Visual Relationship Detection  with Language Priors
LSTM
On the Properties of Neural Machine Translation= Encoder–Decoder Approaches
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
Attention Is All You Need