By GrandMingLakeのSummerRainLotus
Daodao Wang Weiye Deng Data Scientist
[email protected]
MS. Analytics - Data Science
Georgetown UniversityData Scientist
[email protected]
MS. Analytics - Data Science
Georgetown University
The goal of this project is to build a multi classification recommendation model to classify the predefined hotel clusters according to the log of customer behavior data (search and other attributes associated with the user events) provided by Expedia. More details could be found on the Kaggle Expedia Hotel Recommendation Page.
Install Spark and sparklyr (Open RStudio, type in the following code in the RStudio Console):
install.packages("sparklyr")
library(sparklyr)
spark_install(version = "1.6.2")
Install other necessary packages (Open RStudio, type in the following code in the RStudio Console:):
install.packages('ggplot2')
install.packages('readr')
install.packages('lubridate')
install.packages('utils')
install.packages('chron')
install.packages('glmnet')
install.packages('sparklyr')
install.packages('dplyr')
install.packages('reshape2')
install.packages('caret')
install.packages('h2o')