Tensor2tensor for neural machine translation

A Vaswani, S Bengio, E Brevdo, F Chollet… - arXiv preprint arXiv …, 2018 - arxiv.org
arXiv preprint arXiv:1803.07416, 2018arxiv.org
… Abstract Tensor2Tensor is a library for deep learning models that is well-suited for neural
machine translation and includes the reference implementation of the state-of-the-art
Transformer model. … n is the sequence length, d is the representation dimension, k is the
kernel size of convolutions and r the size of the neighborhood in restricted self-attention.
Layer Type … In Tensor2Tensor, we can visualize attention distributions from our models
for each individual layer and head. Observing them closely, we see that the models learn to …
Tensor2Tensor is a library for deep learning models that is well-suited for neural machine translation and includes the reference implementation of the state-of-the-art Transformer model.
arxiv.org
Showing the best result for this search. See all results