The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

I want to duplicate data in RapidMiner

AdiletkgzAdiletkgz Member Posts: 5 Learner I
Would you mind to help me to find an operator that will duplicate (increase number of needed information) rows with data of specific types of labeled data.

Best Answers

Answers

  • lionelderkrikorlionelderkrikor RapidMiner Certified Analyst, Member Posts: 1,195 Unicorn
    Hi @Adiletkgz,

    I have difficulties to understand. Can you provide an example of what you have and what you want to obtain ?

    Regards,

    Lionel
  • varunm1varunm1 Member Posts: 1,207 Unicorn
    Hi @Adiletkgz

    Are you asking for making copies of same data to attach to different operators? If so, you can use multiply operator. This will give you same data as many copies you need.

    Thanks,
    Varun
    Regards,
    Varun
    https://www.varunmandalapu.com/

    Be Safe. Follow precautions and Maintain Social Distancing

  • AdiletkgzAdiletkgz Member Posts: 5 Learner I
    Hello friends!
    I have a data of clients, which are labeled as "Bad clients" and "Good clients". My problem is that I have a lot of "Good clients" and very few "Bad clients". I want to make copies of "Bad clients" in order to increase prediction accuracy. I plan to create credit scoring model by identifying predictors of "Bad clients". 
    Here an example of my data:
    att1   att2   att3   att4
    M      No     Yes   Good client
    M      Yes     Yes   Good client
    M      Yes     No   Good client
    M      Yes     No   Good client
    M      No     No   Good client
    M      No     Yes   Good client
    M      Yes     No   Bad client

  • AdiletkgzAdiletkgz Member Posts: 5 Learner I
    I want from this data, where I have limited "Bad clients":
    att1   att2   att3   att4
    M      No     Yes   Good client
    M      Yes     Yes   Good client
    M      Yes     No   Good client
    M      Yes     No   Good client
    M      No     No   Good client
    M      No     Yes   Good client
    M      Yes     No   Bad client

    To have this, where "Bad clients" are duplicated:
    att1   att2   att3   att4
    M      No     Yes   Good client
    M      Yes     Yes   Good client
    M      Yes     No   Good client
    M      Yes     No   Good client
    M      No     No   Good client
    M      No     Yes   Good client
    M      Yes     No   Bad client
    M      Yes     No   Bad client
    M      Yes     No   Bad client
    M      Yes     No   Bad client
    M      Yes     No   Bad client

    As a result I will have approximately equal amount of "Bad clients" and "Good clients"
  • AdiletkgzAdiletkgz Member Posts: 5 Learner I
    @lionelderkrikor @varunm1
    Sorry for inconvenience, I am new here = )
Sign In or Register to comment.