Uncertainty random forest regression

LinaTeck · April 3, 2022, 11:49pm

Hi - I am using the tensorflow random forest regression and I’m wondering whether there is any way to get an uncertainty estimate out of it? I know I can get RMSE outputs but I guess typically uncertainty is expressed in the band of the standard deviation or the confidence interval so I was wondering whether there is any way to get those?

Thanks a lot!

Mathieu · April 4, 2022, 9:41am

Hi Lina,

Short answer, not yet directly. However, the following options are available:

The NGBoost paper presents a method to obtain the uncertainty of a prediction. For example, the method can be used to get the uncertainty as a standard deviation. NGBoost in TF-DF is implemented as we speak, and it should be made available soon.
If your features are compatible, an alternative solution is to use the NGBoost’s author implementation compatible with Scikit learn, and to use the Scikit-learn to TF-DF converter.
Finally, a less efficient but likely correct solution is to train a second model to predict the error of the first one. The main drawback of this solution is that you need more training examples as the two models (the one predicting the value and the one predicting the error) should be trained on different datasets.
The sample efficiently of solution #3 can be improved by running an ensemble scheme: For each ensemble round, randomly split the training dataset in two parts, and use each part to train one of the two models.

Topic		Replies	Views
How to get standard deviation (or other measure of confidence) from tfdf.RandomForestModel regression? General Discussion tfdf , random_forests , help_request	3	1928	July 30, 2021
Tfdf: R2 value, and SHAP value plot? General Discussion tfdf , help_request	2	1432	July 6, 2021
Staged_predict for Boosted Tree classifier #51834 General Discussion tfdf , help_request	0	857	September 5, 2021
Decision Forest module yanked General Discussion decision_forests , tfdf , random_forests	21	4626	March 13, 2023
A new notebook: Structured Data Classification using TensorFlow Decision Forests Show and Tell models , keras , decision_forests , tensorboard	7	3093	October 11, 2021

Uncertainty random forest regression

Related topics