Mixture-of-Experts-Survey

Mixture-of-Experts-Survey

Visual Relationship Detection  with Language Priors

Visual Relationship Detection with Language Priors

LSTM

LSTM

On the Properties of Neural Machine Translation= Encoder–Decoder Approaches

On the Properties of Neural Machine Translation= Encoder–Decoder Approaches

Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling

Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling

Attention Is All You Need

Attention Is All You Need