Hotel Reviews from around the world with Sentiment Values and Review Ratings in different Categories for Natural Language Processing
The dataset consists of reviews for various hotels throughout the world and data columns range from Location, Trip Type to various parameters of reviewing with individual review score. The data can be preprocessed and used for various purposes ranging from review categorization, topic extraction, sentiment analysis, location based quality calculation etc. Trustworthy real world data comes handy now-a-days and is tough to get a grasp on. So this dataset will be a good contribution for the researcher community as well as professionals.
The dataset consists of 69308 instances containing all columns. A seperate file containing only sentiment values ranging from
[-1,1] have also been added.
The various data headings are :
· Sleep Quality
· Check in / front desk
· Business service (e.g., internet access)
Starting from Value all columns are ratings given by reviewers. Presence of -1 depicts missing values.
The sentiment data file consists of Id, Text and Sentiment Value .
The sentiment value was extracted from given dataset using some preprocessing and semi-supervised algorithm.
All the values are Tab separated Values.