Publication:
Confidence intervals for the random forest generalization error

Organizational Units

Abstract

We show that the byproducts of the standard training process of a random forest yield not only the well known and almost computationally free out-of-bag point estimate of the model generalization error, but also open a direct path to compute confidence intervals for the generalization error which avoids processes of data splitting and model retraining. Besides the low computational cost involved in their construction, these confidence intervals are shown through simulations to have good coverage and appropriate shrinking rate of their width in terms of the training sample size.

Keywords

Random forests; Generalization error; Out-of-bag estimation; Confidence interval; Bootstrapping
item.page.isbound

Journal Title

Pattern Recognition Letters
item.page.doi

Book's title

item.page.scopusurl

Main language

Inglês

Notes

Examination board

Subject Area - CNPq Classification

CIENCIAS SOCIAIS APLICADAS

Citation

Endorsement

Review

Supplemented By

Referenced By