Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature engineering for image data #17

Open
gustavomodelli opened this issue Dec 2, 2020 · 4 comments
Open

Feature engineering for image data #17

gustavomodelli opened this issue Dec 2, 2020 · 4 comments

Comments

@gustavomodelli
Copy link

Image Classification with Tidymodels

Suggest Recipe: step_image

Is It possible to create a recipe that makes image processing?

Is It possible to process images like array reshape in Keras?

The tidymodels are an excellent package but there are no options to image process

@topepo
Copy link
Member

topepo commented Dec 2, 2020

It's a great and would really improve the modeling environment.

I think that it would require the images to be stored within the recipe in a different format that is more conducive to how the data are usually handled. This could get a little tricky but @EmilHvitfeldt's textrecipes does something very similar for text and it works very well.

If you are interested in starting a side package to do this, let us know. Otherwise, this might be lower down the line in priority unless @dfalbel or @skeydan are interested. They have a better handle on processing tools for those data.

@EmilHvitfeldt
Copy link
Member

sounds like an interesting idea. I haven't done a lot of work with images myself. the main issue you might run into is that recipes only allows output to be 2-dimensional and I don't know if that will be troublesome.

If you need to talk to someone who has spent some time handling non-rectangular data inside recipe then I'm happy to talk.

@dfalbel
Copy link

dfalbel commented Dec 3, 2020

In theory it would be possible to use {tfhub} for that.

We have implemented steps for text embedding here: https://github.com/rstudio/tfhub/blob/master/R/recipe.R and we could have similar functionality for images.

I imagine that images would need to be stored in an list-column where each element is an array.

@juliasilge
Copy link
Member

juliasilge commented Apr 27, 2021

I'm moving this to our planning repo since this is more about long term ideas (and probably a separate package) than a feature request here in recipes. 👍

@juliasilge juliasilge transferred this issue from tidymodels/recipes Apr 27, 2021
@juliasilge juliasilge changed the title step_image Feature engineering for image data Apr 27, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants