-
Notifications
You must be signed in to change notification settings - Fork 0
Home
Hai To edited this page Dec 14, 2018
·
2 revisions
In this repo we will explore the Yelp data-set from Kaggle a little bit.
- excerpt of Yelp's businesses, reviews and user data
- for 11 metropolitan areas across 4 countries
- 5,200,000 user reviews on 174,000 businesses
Market / Businesses
- Can we identify "trending" areas / neighbourhoods?
- hip areas like Neukoeln, Kreuzberg etc.
- Are the clusters of areas homogeneous in e.g. price level / categories / styles?
- think of posh (Friedrichstrasse) vs. trashy (Warschauerstr.)
- Would a retailer's store fit more into certain areas?
- given a list of retailer venues, can we predict checkins (or sales even?)
- Are there businesses that synergize?
Users
- Do user affect / influence their friends (to test / visit venues)?
- can we identify inluencers
- Can we classify users into archetypes?
- e.g. foodies, hipsters, fashionistas, parents, yuppies, etc.
- Can we recommend businesses / venues to users?
- predict user rating for biz
- Can we predict whether / which businesses a user will visit next?
- base on behaviour, preference, friends etc.