Research Article

Price Prediction Using Web Scraping and Machine Learning Algorithms in the Used Car Market

Volume: 6 Number: 2 August 31, 2023
EN

Price Prediction Using Web Scraping and Machine Learning Algorithms in the Used Car Market

Abstract

The development of technology increases data traffic and data size day by day. Therefore, it has become very important to collect and interpret data. This study, it is aimed to analyze the car sales data collected using web scraping techniques by using machine learning algorithms and to create a price estimation model. The data needed for analysis was collected using Selenium and BeautifulSoup and prepared for analysis by applying various data preprocessing steps. Lasso regression and PCA analysis were used for feature selection and size reduction, and the GridSearchCV method was used for hyperparameter tuning. The results were evaluated with machine learning algorithms. Random Forest, K-Nearest Neighbor, Gradient Boost, AdaBoost, Support Vector and XGBoost regression algorithms were used in the analysis. The obtained analysis results were evaluated together with Mean Square Error (MSE), Root Mean Square Error (RMSE) and Coefficient of Determination (R-square). When the results for data set 1 were examined, the model that gave the best results was XGBoost Regression with 0.973 R2, 0.026 MSE and 0.161 RMSE values. When the results for data set 2 were examined, the model that gave the best results was K-Nearest Neighbor Regression with 0.978 R2, 0.021 MSE and 0.145 RMSE values.

Keywords

References

  1. [1] Milev, P., Conceptual Approach for Development of Web Scraping Application for Tracking Information. Economic Alternatives, 475-485, 2017.
  2. [2] Khder, M., Web Scraping or Web Crawling: State of Art, Techniques, 73 Approaches and Application. International Journal of Advances in Soft Computing and its Applications, 2021.
  3. [3] Banerjee, R., Website Scraping, Happiest Minds Technologies, 2014.
  4. [4] Haddaway, N., The use of web-scraping software in searching for grey literature. Grey Journal, 11(3):186-190, 2015.
  5. [5] Gegic, E.; Isakovic, B.; Keco, D.; Masetic, Z.; Kevric, J. Car price prediction using machine learning techniques. TEM J. 2019, 8, 113.
  6. [6] Asghar, M., Mehmood, K., Yasin, S., & Khan, Z. M., Used Cars Price Prediction using Machine Learning with Optimal Features. Pakistan Journal of Engineering and Technology, 4(2), 113-119, 2021.
  7. [7] Pandey, A., Rastogi, V., & Singh, S., Car’s selling price prediction using random forest machine learning algorithm. In 5th International Conference on Next Generation Computing Technologies, 2020.
  8. [8] Chen, K.-P., Liang, T.-P., Yin, S.-Y., Chang, T., Liu, Y.-C., & Yu, Y.-T., How serious is shill bidding in online auctions? evidence from eBay motors. work, 1–51, 2020.

Details

Primary Language

English

Subjects

Software Engineering (Other)

Journal Section

Research Article

Early Pub Date

August 30, 2023

Publication Date

August 31, 2023

Submission Date

June 2, 2023

Acceptance Date

August 28, 2023

Published in Issue

Year 2023 Volume: 6 Number: 2

APA
Yılmaz, S., & Selvi, İ. H. (2023). Price Prediction Using Web Scraping and Machine Learning Algorithms in the Used Car Market. Sakarya University Journal of Computer and Information Sciences, 6(2), 140-148. https://doi.org/10.35377/saucis...1309103
AMA
1.Yılmaz S, Selvi İH. Price Prediction Using Web Scraping and Machine Learning Algorithms in the Used Car Market. SAUCIS. 2023;6(2):140-148. doi:10.35377/saucis.1309103
Chicago
Yılmaz, Seda, and İhsan Hakan Selvi. 2023. “Price Prediction Using Web Scraping and Machine Learning Algorithms in the Used Car Market”. Sakarya University Journal of Computer and Information Sciences 6 (2): 140-48. https://doi.org/10.35377/saucis. 1309103.
EndNote
Yılmaz S, Selvi İH (August 1, 2023) Price Prediction Using Web Scraping and Machine Learning Algorithms in the Used Car Market. Sakarya University Journal of Computer and Information Sciences 6 2 140–148.
IEEE
[1]S. Yılmaz and İ. H. Selvi, “Price Prediction Using Web Scraping and Machine Learning Algorithms in the Used Car Market”, SAUCIS, vol. 6, no. 2, pp. 140–148, Aug. 2023, doi: 10.35377/saucis...1309103.
ISNAD
Yılmaz, Seda - Selvi, İhsan Hakan. “Price Prediction Using Web Scraping and Machine Learning Algorithms in the Used Car Market”. Sakarya University Journal of Computer and Information Sciences 6/2 (August 1, 2023): 140-148. https://doi.org/10.35377/saucis. 1309103.
JAMA
1.Yılmaz S, Selvi İH. Price Prediction Using Web Scraping and Machine Learning Algorithms in the Used Car Market. SAUCIS. 2023;6:140–148.
MLA
Yılmaz, Seda, and İhsan Hakan Selvi. “Price Prediction Using Web Scraping and Machine Learning Algorithms in the Used Car Market”. Sakarya University Journal of Computer and Information Sciences, vol. 6, no. 2, Aug. 2023, pp. 140-8, doi:10.35377/saucis. 1309103.
Vancouver
1.Seda Yılmaz, İhsan Hakan Selvi. Price Prediction Using Web Scraping and Machine Learning Algorithms in the Used Car Market. SAUCIS. 2023 Aug. 1;6(2):140-8. doi:10.35377/saucis. 1309103

Cited By

 

INDEXING & ABSTRACTING & ARCHIVING

 

31045 31044   ResimLink - Resim Yükle  31047 

31043 28939 28938 34240
 

 

29070    The papers in this journal are licensed under a Creative Commons Attribution-NonCommercial 4.0 International License