House Prices Prediction : Multiple Linear Regression vs Ridge vs Polynomial

JASMAN PARDEDE, RAYYAN RAYYAN

Sari


Abstract

The phenomenon of falling or rising house prices has attracted the interest of researchers as well as many other interested parties. The house not only be used as a place to live, it is also used as an investment instrument. Errors in determining the price of the house can result in losses. However, with data from developers, machine learning models can be applied for price predictive analysis. Several methods are used such as multiple linear regression, ridge, and polynomial. Model performance was measured using evaluation matrices such as Mean Squared Error (MSE), Root Mean Squared Error (RMSE), and also Mean Absolute Error (MAE). Multiple linear regression models yielding values of 0.0952, 0.3086, 0.2452, ridge yielding values of 0.0952, 0.3086, 0.2453, and polynomial yielding values of 0.0874, 0.2955, 0.2344. These results prove that the polynomial regression model with a value of degree = 2, coupled with a regularization technique using ridge regression with a value of Alpha = 100 can produces the best performance judging from the value of the error matrix it produces, the model will also be used to predict house prices in a web-based applications.

Keywords: Multiple Linear Regression, Ridge Regression, Polynomial Regression

Abstrak

Fenomena turun atau naiknya harga rumah telah menarik minat dari peneliti juga banyak pihak lain yang berkepentingan. Rumah tidak hanya dijadikan sebagai tempat tinggal, rumah juga digunakan sebagai instrumen investasi. Kesalahan menentukan harga rumah dapat mengakibatkan kerugian. Namun, dengan adanya data – data dari pengembang, pembuatan model machine learning dapat diaplikasikan guna keperluan analisis prediktif harga. Beberapa metode yang digunakan seperti regresi linear berganda, ridge, dan polinomial. Performa model diukur menggunakan matriks evaluasi seperti Mean Squared Error (MSE), Root Mean Squared Error (RMSE), dan juga Mean Absolute Error (MAE). Model regresi linear berganda menghasilkan nilai 0.0952, 0.3086, 0.2452, ridge menghasilkan nilai 0.0952, 0.3086, 0.2453, dan polinomial menghasilkan nilai 0.0874, 0.2955, 0.2344. Hasil tersebut membuktikan bahwa model regresi polinomial dengan nilai degree = 2, ditambah dengan teknik regularisasi regresi ridge dengan nilai Alpha = 100 dapat menghasilkan performansi terbaik dilhat dari nilai matriks error yang dihasilkannya, model tersebut juga akan digunakan untuk melakukan prediksi harga rumah pada aplikasi yang berbasis website.

Kata kunci: Regresi Linear Berganda, Regresi Ridge, Regresi Polinomial 


Teks Lengkap:

PDF (English)

Referensi


Deb, C., Zhang, F., Yang, J., Lee, S. E., & Shah, K. W. (2017). A review on time series forecasting techniques for building energy consumption. In

Renewable and Sustainable Energy Reviews (Vol. 74, pp. 902–924).

Febyanti, F. (2022). Pemodelan Faktor-Faktor yang Mempengaruhi Harga Rumah di Jabodetabek Menggunakan Metode Regresi Probit.

Hoerl, R. W. (2020). Ridge Regression: A Historical Context. Technometrics, 62(4), 420–425.

Mahesh, B. (2018). Machine Learning Algorithms-A Review Machine Learning Algorithms-A Review View project Six Stroke Engine View project Batta Mahesh Independent Researcher Machine Learning Algorithms-A Review.

Mcleod, S. (2018). Maslow’s Hierarchy of Needs.

Naser, M. Z., & Alavi, A. H. (2020). Insights into Performance Fitness and Error Metrics for Machine Learning.

Ranstam, J., & Cook, J. A. (2018). LASSO regression. British Journal of Surgery, 105(10), 1348.

Ray, S. (2019). A Quick Review of Machine Learning Algorithms

Reza Fahlepi, M., Widjaja, A., & Surya Sumantri No, J. (2019). Penerapan Metode Multiple Linear Regression Untuk Prediksi Harga Sewa Kamar Kost.

Roihan, A., Abas Sunarya, P., & Rafika, A. S. (2020). IJCIT (Indonesian Journal on Computer and Information Technology) Pemanfaatan Machine Learning dalam Berbagai Bidang: Review paper. In IJCIT (Indonesian Journal on Computer and Information Technology) (Vol. 5, Issue 1).

Sarstedt, M., & Mooi, E. (2019). Regression Analysis (pp. 209–256).

Taylor, S. J., & Letham, B. (2018). Forecasting at Scale. American Statistician, 72 (1), 37–45.

Thamarai, M., & Malarvizhi, S. P. (2020). House Price Prediction Modeling Using Machine Learning. International Journal of Information Engineering and Electronic Business, 12 (2), 15–20.

Tranmer, M., Murphy, J., Elliot, M., & Pampaka, M. (2020). Multiple Linear Regression (2nd Edition).

Wahyuni, E. D., Arifiyanti, A. A., & Kustyani, M. (2019). Exploratory Data Analysis dalam Konteks Klasifikasi Data Mining. 263–269.

Zhou, C. (2021). House price prediction using polynomial regression with Particle Swarm Optimization. IOP Conference Series: Earth and Environmental Science, 1802 (3).




DOI: https://doi.org/10.26760/mindjournal.v8i1.14-26

Refbacks

  • Saat ini tidak ada refbacks.


____________________________________________________________

ISSN (cetak) : 2338-8323   |  ISSN (elektronik) :   2528-0902 

diterbitkan oleh :

Informatika Institut Teknologi Nasional Bandung

Alamat : Gedung 2 Jl. PHH. Mustofa 23 Bandung 40124

Kontak : Tel. 7272215 (ext. 181)  Fax. 7202892

Email : mind.journal@itenas.ac.id

____________________________________________________________

Statistik Pengunjung :

  Flag Counter

  Web
Analytics Statistik Pengunjung

 Jurnal ini terlisensi oleh Creative Commons Attribution-ShareAlike 4.0 International License.

Creative Commons License