The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Balancing Data Problem: I got zero results

yogipajarilloyogipajarillo Member Posts: 4 Contributor I
edited November 2018 in Help

Hi! I used the credit card fraud dataset from Kaggle. I just want to fix the Class column (where 0 = no fraud and 1 = fraud)

imbalanced dataset spreadsheet.png

 

It's highly imbalanced 

imbalanced dataset bar chart.png

 

I tried this method
sampling.png

Which got me nowhere

result.png

 

What am i doing wrong?

 

 

 

 

Best Answer

Answers

  • lionelderkrikorlionelderkrikor RapidMiner Certified Analyst, Member Posts: 1,195 Unicorn

    Hi @yogipajarillo,

     

    For me, it works with the following settings (chosen arbitrarily) : 

    Balanced_1.pngpmpmmp

    and then the statistics are : 

    Balanced_2.png

    Regards,

     

    Lionel

  • yogipajarilloyogipajarillo Member Posts: 4 Contributor I

    thank you @earmijo and @lionelderkrikor! i could swear i specified the Sample Size Per Class  the first time around, but in any case, i tried again now and the process worked. if you guys had not replied, i would have abandoned the problem and moved on to looking for a different method. thank you again for your time!

    result.png

     

Sign In or Register to comment.