Context Vectors Are Reflections of Word Vectors in Half the Dimensions
dc.contributor.author | Assylbekov, Zhenisbek | |
dc.contributor.author | Takhanov, Rustem | |
dc.date.accessioned | 2019-12-11T08:03:54Z | |
dc.date.available | 2019-12-11T08:03:54Z | |
dc.date.issued | 2019-09 | |
dc.description | https://arxiv.org/pdf/1902.09859.pdf | en_US |
dc.description.abstract | This paper takes a step towards theoretical analysis of the relationship between word embeddings and context embeddings in models such as word2vec. We start from basic probabilistic assumptions on the nature of word vectors, context vectors, and text generation. These assumptions are supported either empirically or theoretically by the existing literature. Next, we show that under these assumptions the widely-used word-word PMI matrix is approximately a random symmetric Gaussian ensemble. This, in turn, implies that context vectors are reflections of word vectors in approximately half the dimensions. As a direct application of our result, we suggest a theoretically grounded way of tying weights in the SGNS model. | en_US |
dc.identifier.citation | Assylbekov, Z., & Takhanov, R. (2019). Context Vectors are Reflections of Word Vectors in Half the Dimensions. Journal of Artificial Intelligence Research, 66, 225–242. https://doi.org/10.1613/jair.1.11368 | en_US |
dc.identifier.other | 10.1613/jair.1.11368 | |
dc.identifier.uri | http://nur.nu.edu.kz/handle/123456789/4369 | |
dc.language.iso | en | en_US |
dc.publisher | AI ACCESS FOUNDATION | en_US |
dc.rights | Attribution-NonCommercial-ShareAlike 3.0 United States | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/3.0/us/ | * |
dc.subject | Context Vectors | en_US |
dc.subject | Reflections of Word Vectors | en_US |
dc.subject | Word Vectors | en_US |
dc.subject | Euclidean norm | en_US |
dc.subject | word2vec | en_US |
dc.title | Context Vectors Are Reflections of Word Vectors in Half the Dimensions | en_US |
dc.type | Article | en_US |
workflow.import.source | science |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Context Vectors are Reflections of Word Vectors in Half the.pdf
- Size:
- 422.9 KB
- Format:
- Adobe Portable Document Format
- Description:
- Article
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 6 KB
- Format:
- Item-specific license agreed upon to submission
- Description: