This project is focused on 1] Develop an algorithm to reliably estimate the value of residential houses based on fixed characteristics. 2] Identify characteristics of houses that the company can cost-effectively change/renovate with their construction team. 3] Evaluate the mean dollar value of different renovations.
The dataset was obtained from the publicily available Ames housing data recently made available on kaggle.
Perform any cleaning, feature engineering, and EDA you deem necessary.
Be sure to remove any houses that are not residential from the dataset.
Identify fixed features that can predict price.
Train a model on pre-2010 data and evaluate its performance on the 2010 houses.
Characterize your model. How well does it perform? What are the best estimates of price?
In order to analyze the data and generate insights out of it, we followed the a process that looked at the data using the following few of many visualizations:
Visualizations
Correlations
Linear Regression