Numerous-Carrot3910
Numerous-Carrot3910 t1_j5ka168 wrote
Reply to comment by trnka in [D] Simple Questions Thread by AutoModerator
Thanks for your response! Even with retaining the top K values of each feature, there are still a large number of features to consider. I haven’t tried the hashing trick, so I will look into that
Numerous-Carrot3910 t1_j5jhhkg wrote
Reply to [D] Simple Questions Thread by AutoModerator
Hi, I’m trying to build a model with a large number of categorical predictor variables that each have a large number of internal categories. Implementing OHE leads to a higher dimensional dataset than I want to work with. Does anyone have advice for dealing with this other than using subject matter expertise or iteration to perform feature selection? Thanks!
Numerous-Carrot3910 t1_j5kuwul wrote
Reply to comment by trnka in [D] Simple Questions Thread by AutoModerator
Yes, I was referring to recursive feature elimination. Thanks for the recommendations