Journal of Information and Communications Technology: Algorithms, Systems And Applications Cover
ISSN: 3107-8761

Journal of Information and Communications Technology: Algorithms, Systems And Applications

Dr. Eva Tuba
Editor-in-Chief
Dr. Eva Tuba

A single-blind peer-reviewed, quarterly, open-access journal committed to advancing cutting-edge research across the full spectrum of ICT.

Research Article* Open AccessCCBYNCPublished online: 15 December 2025

Advanced Feature Engineering for Residential Property Valuation: A Case Study on King County Housing Data

Aditi Nagayach, Atul Samadhiya

1 Data Science Institute, Frank J. Guarini School of Business, Saint Peters University, Jersey City, New Jersey, 07306, USA

2 Business Administration, Executive M.B.A. New England College, New Hampshire, 03242, USA

*Email: anagayach@saintpeters.edu

J. Inf. Commun. Technol. Algorithms Syst. Appl., 2025, 1(3), 25313 https://doi.org/10.64189/ict.25313

Received: 02 October 2025 | Revised: 12 December 2025 | Accepted: 13 December 2025

Cite article

A. Nagayach, A. Samadhiya, Advanced feature engineering for residential property valuation: a case study on King County housing data, Journal of Information and Communications Technology: Algorithms, Systems and Applications, 2025, 1(3), 25313, doi: . https://doi.org/10.64189/ict.25313

Rights and permissions

(c) The Author(s) 2025.

CC BY-NC 4.0

Open Access

This article is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License, which permits the non-commercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as appropriate credit is given and changes are indicated. https://creativecommons.org/licenses/by-nc/4.0/

Abstract

Accurate property valuation is critical for real estate markets, financial institutions, and urban planning. Traditional appraisal methods are time-intensive and subjective, while complex machine learning models often lack interpretability. This study addresses these challenges by developing an advanced linear regression framework that balances predictive accuracy with model transparency through systematic feature engineering. Utilizing the King County House Sales dataset comprising 21,613 transactions from May 2014 to May 2015, we developed 40 engineered features including interaction terms, polynomial features, ratio calculations, and location-based composites. After outlier removal using the interquartile range method, our dataset consisted of 20,467 properties with 55 total features. The optimized linear regression model achieved a test R² of 0.7198 with a normalized root mean square error (NRMSE) of 0.20 (20% of mean property value) and mean absolute error of 82,626. Feature importance analysis revealed that basement-to-living ratio, above-to-living ratio, and geographic coordinates were the most influential predictors. Cross-validation demonstrated model stability with a mean R² of 0.7316 (±0.0101). This research demonstrates that strategic feature engineering can significantly enhance linear regression performance for real estate valuation, achieving an average prediction error within 20% of property values while providing a transparent and interpretable alternative to complex machine learning algorithms.

Graphical Abstract

Advanced Feature Engineering for Residential Property Valuation: A Case Study on King County Housing Data graphical abstract

Novelty Statement

This research introduces a comprehensive feature engineering framework that substantially enhances linear regression performance for residential property valuation.