The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

How can I take only the variables with at least 5.000 observations?

ceci_roceci_ro Member Posts: 3 Learner III
Hello folks, 

I need a hand here...
How can I take only the variables with at least 5.000 observations?
I have too many variables, thank you in advance.


Cecilia 



Best Answers

  • BalazsBaranyBalazsBarany Administrator, Moderator, Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert Posts: 955 Unicorn
    Solution Accepted
    Hi @ceci_ro

    one approach would be using the Quality Measures operator. It calculates measures like missing values for each attribute.
    Then "ExampleSet to Weights" from the Converters extension. Here you can select the attribute name and the measure you need (missing values). 
    Then "Select by Weights" with a copy of the original data and the weights you created. Weight relation = less equals, weight = e. g. 0.2 or whatever is appropriate for your data.

    Regards,
    Balázs 
  • ceci_roceci_ro Member Posts: 3 Learner III
    Solution Accepted
    There is an operator that does this function: Toolbox extension, Filter Attributes with Missing Values ​​operator. Simple and beautiful.

Answers

Sign In or Register to comment.