Predicting House Price in Hong Kong #4

Date: 26 July 2020 In #3, I faced difficulty in having a lot of missing values in the property transaction data. After searching the web, I found that Centaline claimed they have spent 10 million HKD to fill the missing values. Maybe, I should try scraping data from Centaline. Luckily, I have scrapped Centaline data …

Predicting House Price in Hong Kong #2

In part 1, we have talked about scraping transactional data of apartments. In part 2, let’s talk about splitting data into a training set and testing set. Why we need to split? The short answer is we want to prevent overfitting/memorizing and hope that the model trained can generalize into unknown cases. For example, if …