Predicting Mortgage Loan Defaults Using Machine Learning Techniques

Danylo Krasovytskyi; Andriy Stavytskyy

doi:10.15388/Ekon.2024.103.2.8

Articles

Danylo Krasovytskyi

Taras Shevchenko National University of Kyiv

https://orcid.org/0009-0008-7017-0175

Andriy Stavytskyy

Taras Shevchenko National University of Kyiv

https://orcid.org/0000-0002-5645-6758

Published 2024-07-16

https://doi.org/10.15388/Ekon.2024.103.2.8

PDF

HTML

Keywords

machine learning
classification
default prediction
mortgage lending
random forest
extreme gradient-boosting decision tree

How to Cite

Krasovytskyi, D. and Stavytskyy, A. (2024) “Predicting Mortgage Loan Defaults Using Machine Learning Techniques”, Ekonomika, 103(2), pp. 140–160. doi:10.15388/Ekon.2024.103.2.8.

Download Citation

Abstract

Mortgage default prediction is always on the table for financial institutions. Banks are interested in provision planning, while regulators monitor systemic risk, which this sector may possess. This research is focused on predicting defaults on a one-year horizon using data from the Ukrainian credit registry applying machine-learning methods. This research is useful for not only academia but also policymakers since it helps to assess the need for implementation of macroprudential instruments. We tested two data balancing techniques: weighting the original sample and synthetic minority oversampling technique and compared the results. It was found that random forest and extreme gradient-boosting decision trees are better classifiers regarding both accuracy and precision. These models provided an essential balance between actual default precision and minimizing false defaults. We also tested neural networks, linear discriminant analysis, support vector machines with linear kernels, and decision trees, but they showed similar results to logistic regression. The result suggested that real gross domestic product (GDP) growth and debt-service-to-income ratio (DSTI) were good predictors of default. This means that a realistic GDP forecast as well as a proper assessment of the borrower’s DSTI through the loan history can predict default on a one-year horizon. Adding other variables such as the borrower’s age and loan interest rate can also be beneficial. However, the residual maturity of mortgage loans does not contribute to default probability, which means that banks should treat both new borrowers equally and those who nearly repaid the loan.

PDF

HTML

This work is licensed under a Creative Commons Attribution 4.0 International License.

Downloads

Download data is not yet available.

Most read articles by the same author(s)

Andriy Stavytskyy, Oleksandra Prokopenko, Investments in Agricultural Machinery and its Efficiency in Ukraine , Ekonomika: Vol. 96 No. 1 (2017): Ekonomika
Erstida Ulvidienė, Irma Meškauskaitė, Andriy Stavytskyy, Vincentas Rolandas Giedraitis, An Investigation of the Influence of Economic Growth on Taxes in Lithuania , Ekonomika: Vol. 102 No. 1 (2023): Ekonomika
Andriy Stavytskyy, Vincent Giedraitis, Darius Sakalauskas, Maik Huettinger, Economic Crises and Emission of Pollutants: a Historical Review of Select Economies amid Two Economic Recessions , Ekonomika: Vol. 95 No. 1 (2016): Ekonomika
Andriy Stavytskyy, Daria Martynovych, THE ECONOMETRIC MODELING OF UKRAINIAN MACROECONOMIC TENDENCIES , Ekonomika: Vol. 91 No. 1 (2012): Ekonomika
Nataliia Versal, Andriy Stavytskyy, Financial Dollarization: Trojan Horse for Ukraine? , Ekonomika: Vol. 94 No. 3 (2015): Ekonomika