
Journal of Information and Communications Technology: Algorithms, Systems And Applications

A single-blind peer-reviewed, quarterly, open-access journal committed to advancing cutting-edge research across the full spectrum of ICT.
Advanced Feature Engineering for Residential Property Valuation: A Case Study on King County Housing Data
J. Inf. Commun. Technol. Algorithms Syst. Appl., 2025, 1(3), 25313 https://doi.org/10.64189/ict.25313
Received: 02 October 2025 | Revised: 12 December 2025 | Accepted: 13 December 2025
Cite article
A. Nagayach, A. Samadhiya, Advanced feature engineering for residential property valuation: a case study on King County housing data, Journal of Information and Communications Technology: Algorithms, Systems and Applications, 2025, 1(3), 25313, doi: . https://doi.org/10.64189/ict.25313
Rights and permissions
(c) The Author(s) 2025.

Open Access
This article is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License, which permits the non-commercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as appropriate credit is given and changes are indicated. https://creativecommons.org/licenses/by-nc/4.0/
Abstract
Accurate property valuation is critical for real estate markets, financial institutions, and urban planning. Traditional appraisal methods are time-intensive and subjective, while complex machine learning models often lack interpretability. This study addresses these challenges by developing an advanced linear regression framework that balances predictive accuracy with model transparency through systematic feature engineering. Utilizing the King County House Sales dataset comprising 21,613 transactions from May 2014 to May 2015, we developed 40 engineered features including interaction terms, polynomial features, ratio calculations, and location-based composites. After outlier removal using the interquartile range method, our dataset consisted of 20,467 properties with 55 total features. The optimized linear regression model achieved a test R² of 0.7198 with a normalized root mean square error (NRMSE) of 0.20 (20% of mean property value) and mean absolute error of 82,626. Feature importance analysis revealed that basement-to-living ratio, above-to-living ratio, and geographic coordinates were the most influential predictors. Cross-validation demonstrated model stability with a mean R² of 0.7316 (±0.0101). This research demonstrates that strategic feature engineering can significantly enhance linear regression performance for real estate valuation, achieving an average prediction error within 20% of property values while providing a transparent and interpretable alternative to complex machine learning algorithms.
Graphical Abstract

Novelty Statement
This research introduces a comprehensive feature engineering framework that substantially enhances linear regression performance for residential property valuation.

