The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

filter test data and split it into sub sets

moohrashmoohrash Member Posts: 2 Contributor I
edited March 2022 in Help
Hello,

I am new to rapidminer. I would like to train a model using cross validation and test it later on subsets ( like female and male users and different regions subsets). I wonder how I can split the test set into subsets, which operators? and what to use for applying the model and getting it performance on those subsets?

Thanks,

Answers

  • kaymankayman Member Posts: 662 Unicorn
    use the 'Set Role' operator to do so. make for instance your user column 'label' and it will automatically create 'subsets' using this column.
  • Telcontar120Telcontar120 RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,635 Unicorn
    You can use the Cross Validation operator once you have a label and you will have different options for creating the subsets such as sequential, randomized, or batch (where you specify the subsets manually via another attribute).

    Brian T.
    Lindon Ventures 
    Data Science Consulting from Certified RapidMiner Experts
Sign In or Register to comment.