Improved estimation methods of the Kolmogorov-Smirnov statistic, weight of evidence and information value indicators in the credit scoring

Authors

  • O. M. Soloshenko Солошенко Олександр Миколайович, аспірант Національного технічного університету України "КПІ", Україна, Київ, Ukraine

Abstract

The improved evaluation methods of the Kolmogorov-Smirnov statistic, Weight of Evidence and Information Value indicators are developed without explicit splitting of the original sample into two subsets with developing corresponding formulas for the predictive (forecasting) power analysis of categorical variables in the credit scoring tasks and other fields of practical application of binary classification methods. The generalization of the classical formulas for the Kolmogorov-Smirnov statistic, Weight of Evidence and Information Value indicators have been performed by means of the aggregate expressions transformation for discrete distributions and cumulative distribution functions applying the inner product of two vectors, projection operators, and also a conditional substitution operator. The improved estimation formulas for the Kolmogorov-Smirnov statistic, Weight of Evidence and Information Value indices are proposed and generally described in terms of the discrete unconditional distribution of the input variable and the conditional distribution of the binary target variable.

Author Biography

O. M. Soloshenko, Солошенко Олександр Миколайович, аспірант Національного технічного університету України "КПІ", Україна, Київ

Солошенко Олександр Миколайович, аспірант Національного технічного університету України "КПІ", Україна, Київ

References

Siddiqi Naeem. Credit risk scorecards: developing and implementing intelligent credit scoring. — Hoboken: John Wiley & Sons, Inc., 2006. — 196 p.

Meyz Elizabet. Rukovodstvo po kreditnomu skoringu. — Minsk: Grevtsov Pablisher, 2008. — 464 s.

Fauler Martin, Sadaladzh Dzh. Pramodkumar. NoSQL: novaya metodologiya razrabotki nerelyatsionnykh baz dannykh. — Minsk: OOO "I.D. Vil’yams", 2013. — 192 s.

Thomas C. Lyn, Edelman B. David, Crook N. Jonathan. Credit Scoring and its Applications. — Philadelphia: Society for Industrial and Applied Mathematics, 2002. — 248 p.

Kullback Solomon. Information Theory and Statistics. — Hoboken, NJ: John Wiley & Sons, 1959. — 395 p.

Bulinskiy А.V., SHiryayev А.N. Teoriya sluchaynykh protsessov. — M.: Fizmatlit, 2005. — 408 s.

Trenogin V.А. Funktsional’nyy analiz. — M.: Nauka, 1980. — 495 s.

Mal’tsev А.I. Osnovy lineynoy algebry. — M.: Nauka, 1975. — 400 s.

Published

2015-12-15

Issue

Section

Mathematical methods, models, problems and technologies for complex systems research