Skip to content

dwy904/RainDrop_ExpediaRecommendation

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

66 Commits
 
 
 
 

Repository files navigation

Expedia Hotel Recommendation

By GrandMingLakeのSummerRainLotus

Daodao Wang Weiye Deng
Data Scientist
[email protected]
MS. Analytics - Data Science
Georgetown University
Data Scientist
[email protected]
MS. Analytics - Data Science
Georgetown University

Introduction

The goal of this project is to build a multi classification recommendation model to classify the predefined hotel clusters according to the log of customer behavior data (search and other attributes associated with the user events) provided by Expedia. More details could be found on the Kaggle Expedia Hotel Recommendation Page.


Environment Setup

Install Spark and sparklyr (Open RStudio, type in the following code in the RStudio Console):

install.packages("sparklyr")
library(sparklyr)
spark_install(version = "1.6.2")

Install other necessary packages (Open RStudio, type in the following code in the RStudio Console:):

install.packages('ggplot2')
install.packages('readr')
install.packages('lubridate')
install.packages('utils')
install.packages('chron')
install.packages('glmnet')
install.packages('sparklyr')
install.packages('dplyr')
install.packages('reshape2')
install.packages('caret')
install.packages('h2o')

Data Cleaning and Feature Engineering

Data Partitioning

Model Initialization

Parameter Tuning and Cross Validation

Model Combination

Model Finalization

About

Expedia Recommender Kaggle

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published