Worth-Advance-1232

Worth-Advance-1232 OP t1_j47drjb wrote

For me this doesn’t really make all that much sense. Stacking is not a tool in itself, it’s rather a modelling approach. Also from what I can tell in the h2o docs it seems that their Stacked Ensemble has only one level to it and no meta-model. Thus rather than using e. g. the outputs (or probability distribution) of each model to train a new model, it will only use the output for any given input directly to return its final prediction, doesn’t it?

1

Worth-Advance-1232 t1_j2nvwv9 wrote

If I understood correctly, rather than actually predicting survival time your goal is to see whether certain parameters have influence on the survival time. Besides the other things already mentioned, one approach might be trying to predict survival time via a black box algorithm and trying to explain the decision of the algorithm. One paper that covers this topic would be this one.

3

Worth-Advance-1232 t1_j0gct0d wrote

Usually it is indeed the case that especially deep learning requires a lot of data. However also with little data neural networks might generate good results using the implementations provided by e. g. scikit learn. Also you might want to look into Boosting Algorithms (specifically XGBoost), as they are usually powerful on tabular data which is a common type of data to have in businesses.

1