The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Newbie Question: removing duplicate entries

shilaskishilaski Member Posts: 8 Contributor II
I am just learning how to use this tool.  I just figured out yesterday that I could load my spreadsheet directly and now I am trying to remove duplicate entries within my data (cleaning it).  I believe there is an operator there (searched for it) but I can't find it.  Anybody help?

Answers

  • TobiasMalbrechtTobiasMalbrecht Moderator, Employee-RapidMiner, Member Posts: 295 RM Product Management
    Hi,

    if you type in "duplicate" in the operator search field you should be able to find it. The operator is located in the group Preprocessing -> Data -> Filter and it is called [tt]RemoveDuplicates[/tt]. However, this operator is pretty new, but I think it should be available at least since version 4.3.2.

    Regards,
    Tobias
  • shilaskishilaski Member Posts: 8 Contributor II
    I still cannot find it.  Is it in the community version?
  • IngoRMIngoRM Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University Professor Posts: 1,751 RM Founder
    Hi,

    basically yes, but the minor numbers like 4.3.2 are only delivered to the Enterprise version customers. But the next version of the community edition (4.4 which will be releases during the next days) will also contain this new operator.

    Cheers,
    Ingo
  • shilaskishilaski Member Posts: 8 Contributor II
    Perfect!  thanks for the update
Sign In or Register to comment.