This pre-trained Word2Vec model has 300-dimensional vectors for more than 0.5 million Nepali words and phrases. A separate Nepali language text corpus was created using the news contents freely available in the public domain. The text corpus contained more than 90 million running words. The "Nepali Text Corpus" can be accessed freely fromĀ http://dx.doi.org/10.21227/jxrd-d245.

Dataset Files

You must be an IEEE Dataport Subscriber to access these files. Subscribe now or login.

Documentation: 
[1] Rabindra Lamsal, "300-Dimensional Word Embeddings for Nepali Language", IEEE Dataport, 2019. [Online]. Available: http://dx.doi.org/10.21227/dz6s-my90. Accessed: Jan. 15, 2025.
@data{dz6s-my90-19,
doi = {10.21227/dz6s-my90},
url = {http://dx.doi.org/10.21227/dz6s-my90},
author = {Rabindra Lamsal },
publisher = {IEEE Dataport},
title = {300-Dimensional Word Embeddings for Nepali Language},
year = {2019} }
TY - DATA
T1 - 300-Dimensional Word Embeddings for Nepali Language
AU - Rabindra Lamsal
PY - 2019
PB - IEEE Dataport
UR - 10.21227/dz6s-my90
ER -
Rabindra Lamsal. (2019). 300-Dimensional Word Embeddings for Nepali Language. IEEE Dataport. http://dx.doi.org/10.21227/dz6s-my90
Rabindra Lamsal, 2019. 300-Dimensional Word Embeddings for Nepali Language. Available at: http://dx.doi.org/10.21227/dz6s-my90.
Rabindra Lamsal. (2019). "300-Dimensional Word Embeddings for Nepali Language." Web.
1. Rabindra Lamsal. 300-Dimensional Word Embeddings for Nepali Language [Internet]. IEEE Dataport; 2019. Available from : http://dx.doi.org/10.21227/dz6s-my90
Rabindra Lamsal. "300-Dimensional Word Embeddings for Nepali Language." doi: 10.21227/dz6s-my90