Skip to content

Latest commit

 

History

History
24 lines (22 loc) · 1.09 KB

README.md

File metadata and controls

24 lines (22 loc) · 1.09 KB

House Price Prediction

Predict house prices in Ames, Iowa using various machine learning models

Iowa Housing Price Prediction Kaggle competiton: https://www.kaggle.com/c/house-prices-advanced-regression-techniques/overview

Project Flow:

  • Import Libraries and Read Data
  • Look at each column in the training data - understand its purpose and importance
  • Expectation: What is my expectation about the variable influence on the 'SalePrice'
  • Separate into numerical and categorical variables
  • Data Visualization of Numerical variables as Scatterplots
  • Data Visualization of Categorical variables as Boxplots
  • Heatmaps -- find multicollinearity
  • top10 Heatmap - highly correlated with 'SalePrice' - remove multicollinear variables
  • Combine Train and Test
  • Handle Missingness
  • Target ('SalePrice') Analysis and log-Transformation
  • Feature Engineer (generate new features)
  • Dummify
  • Split back into Train and Test and Export Data into csv files
  • Model Testing And Running: Lasso, Ridge, ElasticNet, Huber, RandomForest, GradientBoost, XGBoost
  • Prediction
  • Model Selection
  • Future Improvements