Structured Zomato Restaurant Metadata and Review Corpus for Sentiment and Recommendation Analysis

Citation Author(s):
Harsh
Mishra
GLA
Ramya
Sharma
GLA
Submitted by:
Harsh Mishra
Last updated:
Wed, 04/09/2025 - 15:31
DOI:
10.21227/bfca-ft23
Data Format:
License:
0
0 ratings - Please login to submit your rating.

Abstract 

This dataset provides detailed information and customer reviews for restaurants listed on Zomato in Bangalore, with a focus on The Nest - The Den Bengaluru, located on ITPL Main Road, Whitefield. It includes key attributes such as location, contact details, rating, cuisines offered, average cost, and detailed user-generated reviews. The dataset is ideal for sentiment analysis, customer feedback mining, restaurant recommendation systems, and hospitality service quality studies.

Instructions: 
  1. Open the Dataset:

    • Use spreadsheet software like Excel, Google Sheets, or load using Python libraries such as Pandas.

    • Example (Python):

      import pandas as pd
      df = pd.read_csv("zomato_sample.csv")

  2. Understand the Columns:

    • Key columns include:

      • Restaurant Name, Location, Cuisines, Rating, Cost, Reviews, etc.

      • The Reviews column contains a list of tuples with both ratings and review text.

  3. Review Parsing (Python):

    • To extract the review data (rating + text), use:

       import ast
      df['parsed_reviews'] = df['Reviews'].apply(lambda x: ast.literal_eval(x))

  4. Sentiment Analysis (Optional):

    • Use NLP libraries like TextBlob or VADER to analyze customer sentiments.

       from textblob import TextBlob
      df['sentiment'] = df['parsed_reviews'].apply(lambda reviews: TextBlob(reviews[0][1]).sentiment.polarity)

  5. Filter by Category or Cuisine:

    • Example:

      df[df['Cuisines'].str.contains("North Indian", na=False)]
      df[df['Zone'].str.contains("Whitefield", na=False)]

  6. Visualization (Optional):

    • Use Power BI, Tableau, or Python libraries (Matplotlib, Seaborn, Plotly) for visual exploration and storytelling.