Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Automl on data with nulls #16402

Open
matt7salomon opened this issue Sep 27, 2024 · 2 comments
Open

Automl on data with nulls #16402

matt7salomon opened this issue Sep 27, 2024 · 2 comments
Labels

Comments

@matt7salomon
Copy link

I am using Automl to only train Deeplearning algorithms on a dataset which is sparse and has a lot of nulls. I preferably shouldnt impute this data because nulls really have a meaning as the customer had not interacted with that feature. Automl results are terrible on this dataset. It doesnt even predict in ballpark of the output. Any suggestions?

@tomasfryda
Copy link
Contributor

I would suggest trying other algorithms, namely the tree-based ones since they can handle missing values as separate category (e.g. GBM, https://docs.h2o.ai/h2o/latest-stable/h2o-docs/data-science/gbm-faq/missing_values.html). You can also keep AutoML unrestricted to find out which model type suits those data the best.

@matt7salomon
Copy link
Author

We already have a non h2o lightgbm model in place which works just fine and not perfect. We wanted to try deep learning.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants