The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Loading each csv file related to images into Deep learning during cross validation

varunm1varunm1 Member Posts: 1,207 Unicorn
edited January 2019 in Help
Hi

I have dataset with 4700 images with dimensions 2000*102. All these images are converted to seperate CSV files (pixel values). I think I can't load all these csv files at a time as this will crash my RAM (32GB). Is it possible for RM to access each image(.csv) file during training and testing from storage location rather than placing it on RAM. This is to train and test CNN in deep learning extension.

@mschmitz @hughesfleming68

Thanks,
Varun
Regards,
Varun
https://www.varunmandalapu.com/

Be Safe. Follow precautions and Maintain Social Distancing

Answers

  • hughesfleming68hughesfleming68 Member Posts: 323 Unicorn
    Hi Varun, interesting question. I have read about progressive loading of image data with Keras with flow from dataframe and flow from directory but I am not sure how that might work in Rapidminer. I have never tried personally. I am wondering if you could store the image data in a database.
  • varunm1varunm1 Member Posts: 1,207 Unicorn
    Hi @hughesfleming68

    Thanks for responding, yes that is what I was wondering as we can load chunks of data from directory using keras in python and it will be useful in RM as well if we have that option (not sure if its there). As sometimes rescaling and downsampling are not good option due to huge loss of data. I will load into database in the form of multiple tables, but I am not aware if it takes table by table when I apply the algorithms.

    Thanks
    Varun
    Regards,
    Varun
    https://www.varunmandalapu.com/

    Be Safe. Follow precautions and Maintain Social Distancing

  • Telcontar120Telcontar120 RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,635 Unicorn
    Seems like a another use case of the old "stream database" operator which is unfortunately now deprecated.  I believe @land has been working on a new streaming extension, but I am not sure whether it is adapted for this particular use case.  
    Brian T.
    Lindon Ventures 
    Data Science Consulting from Certified RapidMiner Experts
Sign In or Register to comment.