The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Augmentation Randomization/Multiply

Sunnyboy_nhSunnyboy_nh Member Posts: 10 Learner I
Any ideas how can I do a randomization with multiply or any way to the an augmentation with my dataset which only has 170 rows ?

The reason is that I need to do a split-test- validation and my dataset ist not big enough for that purpose!

Answers

  • Telcontar120Telcontar120 RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,635 Unicorn
    I would recommend exploring weighting as a solution to imbalanced, small datasets.  The other alternative is to us one of the upsampling operators from one of the free extensions.
    Brian T.
    Lindon Ventures 
    Data Science Consulting from Certified RapidMiner Experts
  • MartinLiebigMartinLiebig Administrator, Moderator, Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,533 RM Data Scientist
    Another way is to use the Build Simulation operator, which is part of Operator Toolbox.

    Cheers,
    Martin
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • Sunnyboy_nhSunnyboy_nh Member Posts: 10 Learner I
    Thanks Martin and Telcontar120 for your feedbacks and suggestions. Meanwhile I have looked at a similar operator in Rapidminer called  Sample(Bootstaping) before Split Data Operator does that data augmentation by  copying the exisiting rows.
    Nevertheless I will try to check your suggestions as well :) 
Sign In or Register to comment.