Skip to main content

Datasets

Standard Dataset

Hotel Reviews from around the world with Sentiment Values and Review Ratings in different Categories for Natural Language Processing

Average: 5 (1 vote)

Abstract

The dataset consists of reviews for various hotels throughout the world and data columns range from Location, Trip Type to various parameters of reviewing with individual review score. The data can be preprocessed and used for various purposes ranging from review categorization, topic extraction, sentiment analysis, location based quality calculation etc. Trustworthy real world data comes handy now-a-days and is tough to get a grasp on. So this dataset will be a good contribution for the researcher community as well as professionals. 

 

Instructions:

The dataset consists of 69308 instances containing all columns. A seperate file containing only sentiment values ranging from

[-1,1] have also been added. 

The various data headings are :

·         ReviewId       

·         UserLocation

·         ReviewedDate

·         HotelName

·         DateOfStay

·         ReviewText   

·         TripType

·         Value    

·         Cleanliness   

·         Service 

·         Location

·         Sleep Quality 

·         Rooms  

·         Check in / front desk    

·         Business service (e.g., internet access)         



Starting from Value all columns are ratings given by reviewers. Presence of -1 depicts missing values.

The sentiment data file consists of Id, Text and Sentiment Value .

The sentiment value was extracted from given dataset using some preprocessing and semi-supervised algorithm.

All the values are Tab separated Values.

 

i am unable to download the dataset actually i want to do extend this but unable to download the dataset. if the dataset available then send me i want to reproduce this paper and my mail is "200320064@fzu.edu.cn".
Jin Cai Mon, 02/20/2023 - 11:56 Permalink
For selecting Bibtex contents, double click on IEEE contents. Then use Ctrl+C to copy. It's a bug and we need to wait till its fixed. Till then this is how you can cite.
Avishek Garain Tue, 10/06/2020 - 07:47 Permalink
i am unable to download the dataset actually i want to do extend this but unable to download the dataset.
Muhammad Awais Thu, 06/17/2021 - 17:14 Permalink
After reading paper "An ensemble-based hotel recommender system using sentiment analysis and aspect categorization of hotel reviews" i like this paper and i want to reproduce the paper. So, please mail me this dataset my mail address is "iamahsanulhaq041@gmail.com".
AHSAN UL HAQ Wed, 05/25/2022 - 06:26 Permalink
if the dataset available then send me i want to reproduce this paper and my mail is "rkmishra6300@gmail.com".
Rohit Mishra Tue, 10/03/2023 - 07:19 Permalink
I am unable to download the dataset. Is there any any way I an download the dataset?
Tirath Savasaiya Wed, 11/15/2023 - 23:35 Permalink