Generates comprehensive summary statistics for optimal binning results, including predictive power classification based on established IV thresholds (Siddiqi, 2006), aggregate metrics, and feature-level diagnostics.
Usage
# S3 method for class 'obwoe'
summary(object, sort_by = "iv", decreasing = TRUE, ...)Value
An S3 object of class "summary.obwoe" containing:
feature_summaryData frame with per-feature statistics including IV classification (Unpredictive/Weak/Medium/Strong/Suspicious)
aggregateNamed list of aggregate statistics:
n_featuresTotal features processed
n_successfulFeatures without errors
n_errorsFeatures with errors
total_iv_sumSum of all feature IVs
mean_ivMean IV across features
median_ivMedian IV across features
mean_binsMean number of bins
iv_rangeMin and max IV values
iv_distributionTable of IV classification counts
targetTarget column name
target_typeTarget type (binary/multinomial)
Details
IV Classification Thresholds
Following Siddiqi (2006), features are classified by predictive power:
| Classification | IV Range |
| Unpredictive | < 0.02 |
| Weak | 0.02 - 0.10 |
| Medium | 0.10 - 0.30 |
| Strong | 0.30 - 0.50 |
| Suspicious | > 0.50 |
Features with IV > 0.50 should be examined for data leakage or overfitting, as such high values are rarely observed in practice.
References
Siddiqi, N. (2006). Credit Risk Scorecards: Developing and Implementing Intelligent Credit Scoring. John Wiley & Sons. doi:10.1002/9781119201731
See also
obwoe for the main binning function,
print.obwoe, plot.obwoe.
Examples
# \donttest{
set.seed(42)
df <- data.frame(
x1 = rnorm(500), x2 = rnorm(500), x3 = rnorm(500),
target = rbinom(500, 1, 0.2)
)
result <- obwoe(df, target = "target")
summary(result)
#> Summary: Optimal Binning Weight of Evidence
#> ============================================
#>
#> Target: target ( binary )
#>
#> Aggregate Statistics:
#> Features: 3 total, 3 successful, 0 errors
#> Total IV: 0.0930
#> Mean IV: 0.0310 (SD: 0.0423)
#> Median IV: 0.0109
#> IV Range: [0.0025, 0.0796]
#> Mean Bins: 5.0
#>
#> IV Classification (Siddiqi, 2006):
#> Unpredictive: 2 features
#> Weak : 1 features
#>
#> Feature Details:
#> feature type n_bins total_iv iv_class
#> x3 numerical 5 0.079623601 Weak
#> x1 numerical 5 0.010895822 Unpredictive
#> x2 numerical 5 0.002463832 Unpredictive
# }
