A distribution-based model to learn bilingual word embeddings

H Cao, T Zhao, S Zhang, Y Meng - Proceedings of COLING 2016 …, 2016 - aclanthology.org
H Cao, T Zhao, S Zhang, Y Meng
Proceedings of COLING 2016, the 26th International Conference on …, 2016aclanthology.org
We introduce a distribution based model to learn bilingual word embeddings from
monolingual data. It is simple, effective and does not require any parallel data or any seed
lexicon. We take advantage of the fact that word embeddings are usually in form of dense
real-valued low-dimensional vector and therefore the distribution of them can be accurately
estimated. A novel cross-lingual learning objective is proposed which directly matches the
distributions of word embeddings in one language with that in the other language. During …
Abstract
We introduce a distribution based model to learn bilingual word embeddings from monolingual data. It is simple, effective and does not require any parallel data or any seed lexicon. We take advantage of the fact that word embeddings are usually in form of dense real-valued low-dimensional vector and therefore the distribution of them can be accurately estimated. A novel cross-lingual learning objective is proposed which directly matches the distributions of word embeddings in one language with that in the other language. During the joint learning process, we dynamically estimate the distributions of word embeddings in two languages respectively and minimize the dissimilarity between them through standard back propagation algorithm. Our learned bilingual word embeddings allow to group each word and its translations together in the shared vector space. We demonstrate the utility of the learned embeddings on the task of finding word-to-word translations from monolingual corpora. Our model achieved encouraging performance on data in both related languages and substantially different languages.
aclanthology.org
Showing the best result for this search. See all results