You have likely figured it out — this site is in the making. Please check back regularly to watch as I build it bit by bit.
This project focuses on applying Natural Language Processing (NLP) techniques to perform sentiment analysis on customer reviews from the Yelp Open Dataset. The goal is to classify individual reviews as positive, neutral, or negative based on customer sentiment, and then aggregate these sentiments at the business or store level. By analyzing sentiment trends, the project aims to identify key areas for improvement and predict potential impacts on sales and customer satisfaction.
You have likely figured it out — this site is in the making. Please check back regularly to watch as I build it bit by bit.
In industries where customer feedback holds significant influence —such as dining, hospitality, and retail— sentiment analysis has proven being a powerful tool for targeted improvements to stay competitive by increasing customer satisfaction.
Meet Lorena Castillo, a fictitious Regional Manager of First Watch restaurants in the Tampa, Florida area. As many other regional managers —maybe just like you— she saw the potential to increase customer value and revenue for her locations through professional sentiment analyses. Follow her journey to discover how her data analyst transformed raw customer feedback into business insights by classifying reviews as positive, negative, or neutral and aggregating the data at the store level. Learn how sentiment analysis can support evidence-based decisions that drive business performance and create customer value.
Lorena Castillo took on the role of Regional Manager
for First Watch Restaurants around Tampa, Florida in October 2019.
After taking over operations, she realized that the restaurant performance metrics across various locations were poorly tracked and inconsistantly reported. Additionally,
customer satisfaction analyses were sparse and outdated, making it difficult to gauge how well stores were performing.
To develop a strong strategy for enhancing restaurant operations, improving customer experience and increasing profitability, Lorena needed a clear understanding
of the current situation at each location. While reviewing customer feedback from various online sources, she quickly became overwhelmed by the sheer volume of
unorganized data and conflicting opinions.
Lorena decided to seek professional support for the analysis from a reliable service provider. Understanding the urgency of the project, she reached out to four
consulting firms for proposals, looking for an analyses delivering clear insights on customer feedback and recommendations to address potential issues in a timely manner.
Lorena reviewed several proposals, and selected the proposal with me as data analyst since it vontained a detailed, transparent, and problem-focused approach.
This proposal exactly provided her with the analyses she needed to make informed decisions for the entire region.
After working closely with Lorena and understanding her needs, we identified the following outcomes to enhance customer satisfaction and business success:
In this project, I am using customer review data provided by Yelp. Yelp offers two methods of data access: Yelp Open Dataset and Yelp Fusion.
Yelp Fusion API provides real-time data but imposes several restrictions. Customer reviews are limited in both the number of requests that can be made per day and the number of characters in each review. Additionally, data retrieved through API can only be cached for 24 hours, and the process of retrieving data can be time-consuming. These limitations can complicate data analysis, especially when it is important to use the same dataset for maintaining reproducibility throughout the project.
For these reasons, I have chosen to use Yelp’s Open Dataset, which is publicly and freely accessible through Yelp Open Data Download site. Yelp generously provides the Open Dataset for non-commercial personal, educational, and academic purposes and rightfully prohibits the uncontrolled redistribution of their data. To comply with Yelp's Terms of Service, I will not save the Open Dataset or derivates of it in my repository. Instead, please download the Open Dataset directly from Yelp's Open Dataset website to run the script.
Near a great forest, there lived a poor woodcutter and his wife and his two children; the boy's name was Hansel, and the girl's name was Gretel. He had little to bite and to break, and once, when great scarcity fell on the land, he could no longer procure even daily bread. Now when he thought over this by night in his bed, and tossed about in his anxiety, he groaned and said to his wife: "What is to become of us? How are we to feed our poor children, when we no longer have anything even for ourselves?"
"I'll tell you what, husband," answered the woman, "early tomorrow morning we will take the children out into the forest to where it is the thickest; there we will light a fire for them,
and give each of them one more piece of bread, and then we will go to our work and leave them alone. They will not find the way home again, and we shall be rid of them."
"No, wife," said the man, "I will not do that; how can I bear to leave my children alone in the forest?—the wild animals would soon come and tear them to pieces."
"Oh, you fool!" said she, "then we must all four die of hunger; you may as well plane the planks for our coffins," and she left him no peace until he consented.
"But I feel very sorry for the poor children, all the same," said the man.