Someone from H2O told me they are using Tree SHAP already in some of their stuff, but I don't use their libraries right now so I don't know where. Is there a specific scenario you want to use SHAP?
They do have shapley values, but only on their enterprise platform. The open version, H2O, does not.
I am currently using H2O because of their automl function. It makes model generation/selection pretty painless. They also handle categorical variables very cleanly. In all, its great for getting models that are ready to predict.
H2O doesn't have much in the way of feature attribution though. SHAP seems to be the king of the hill right now and I would prefer to run SHAP against the actual model and not a sci-kit based model with the same parameters. That way I can dig into odd predictions with certainty that the outliers are directly related to the model from h2o.
Ah, I see. If someone sends in a PR then I am happy to merge it, but I am not going to actively try and reproduce what seems like one of their enterprise features.
Would really appreciate support for h2o.ai models as well!
In case someone missed it:
As of 3.24.0.1, the free/open H2O supports TreeSHAP (GBM and XGBoost) -- see the blog post on their site.
Many thanks to all who worked on this, certainly makes life much easier (and easier to understand).
We also added support for random forest recently - thanks to @navdeep-G.
Most helpful comment
In case someone missed it:
As of 3.24.0.1, the free/open H2O supports TreeSHAP (GBM and XGBoost) -- see the blog post on their site.
Many thanks to all who worked on this, certainly makes life much easier (and easier to understand).