The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
How to balance data based on attribute value?
User222697
Member Posts: 17 Contributor II
in Help
Hi,
i have a dataset with an attribute called "demo".
There are 3 possibles values in "demo" atribute:
"Alpha" which counts 4000 rows
"Beta" which counts 3000 rows
"Omega" which counts 2000 rows
How can i generate a new dataset balanced with the same number of rows?
"Alpha" which counts 2000 rows
"Beta" which counts 2000 rows
"Omega" which counts 2000 rows
Thanks
i have a dataset with an attribute called "demo".
There are 3 possibles values in "demo" atribute:
"Alpha" which counts 4000 rows
"Beta" which counts 3000 rows
"Omega" which counts 2000 rows
How can i generate a new dataset balanced with the same number of rows?
"Alpha" which counts 2000 rows
"Beta" which counts 2000 rows
"Omega" which counts 2000 rows
Thanks
0
Answers
look at the tutorial process for the Sample operator (linked in the help text).
It shows that the "balance data" is being used for this. You need to set the role of your demo attribute to "label", activate "balance data" in Sample, set the sampling method to absolute, and then enter the desired number of examples in the Edit List dialog.
Regards,
Balázs