The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
"Remove Samples With Many Missing Values"
I would like to filter examples that have more than a certain number of missing values, so I can then apply attribute filtering and not loose my entire dataset.
Any way to do this, the filter operator currently filters samples with any missing values.
Thanks,
-Gagi
Any way to do this, the filter operator currently filters samples with any missing values.
Thanks,
-Gagi
Tagged:
0
Answers
you have already posted in another thread that was presenting the solution for this problem. Why did you open just another one?
Greetings,
Sebastian
For example: Filter all examples that have more than 10 missing values for the current set of attributes.
This way we can do data shrinking from both dimensions. I have a lot of samples so this would be useful. ;D
Thanks,
-Gagi
excuse me, I should have read more carefully. Especially after I was surprised, that suddenly you are starting to spam the forum. Well, back to your original questions:
The computer scientist in me wants to answer: Transpose the ExampleSet and solve the old attribute problem, transpose it again and the things are cleared. But it doesn't seem to be appropriate So let's see...
After I tried quite some time, I didn't come up with a really satisfying solution. Here's the ground idea: Count each missing value into a new attribute and then filter the examples accordingly. Here's the process for counting the missings: Why this isn't satisfying? Because it only works on real valued attributes. So if this is a solution depends on what you need...
This again is a valuable feature, we should include in future versions. Please add it to the bug tracker as a feature request.
Greetings,
Sebastian
can you share a word on the if(%{currentAttribute}!=%{currentAttribute}...) statement. Seems to be very helpful. What's behind it?
Thx
ui3o
I was able to remove attributes and then filter examples for a similar effect.
The if statement looks like it follows this syntax: if(statement, true_action, false_action), correct?
Is there a list of supported functions in RM that we can use in expressions?
Thanks Again,
-Gagi
Late night waiting for US calls so I thought I'd take a look at making a missing value count column, and then bolting that on to the original data, like this.. Be wary of the part of you which doubted the appropriateness of transposition
This will be very useful.
The transpose is quite smart and lets you play within the confines of the RM space.
-Gagi