Random Forest: Out of bag error

1079
1
01-19-2017 10:21 AM
DanaPeterson1
New Contributor II

I am running Random Forest Classifier using ArcGIS 10.4.  How do you assess the error of the tree ensemble, which is typically done using out of bag (OOB) estimate?  I opened the .ecd file and I see "CrossValidateRate" and a value.  What is this rate referring to and is it the equivalent of OOB estimate?  Are bootstrap samples used to derive "CrossValidateRate"?

According to Breiman:  

"In random forests, there is no need for cross-validation or a separate test set to get an unbiased estimate of the test set error. It is estimated internally, during the run, as follows:

Each tree is constructed using a different bootstrap sample from the original data. About one-third of the cases are left out of the bootstrap sample and not used in the construction of the kth tree".

0 Kudos
1 Reply
JohnGallop
New Contributor II

I have the same question...but 6 years later and doesn't look like your question was answered. Do you know the answer?

 

0 Kudos