The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Data Manipulation

ad2045ad2045 Member Posts: 5 Learner I
I have the following data:

User ID Category Subcategory
02 A Sub1
02 B Sub2
03 C Syb7

I would like to group the data by User ID as follows:

ID
CategotyA (contains the number of subcategories)
CategoryB (contains the number of subcategories)
CategoryC( contains the number of subcategories)

example output

User IDCategoryACategoryB
022011
03555
04921

Could someone help by providing the process and parameter needed?

Best Answer

Answers

  • Telcontar120Telcontar120 RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,635 Unicorn
    Use the Aggregate operator, and it looks like you want to Group By User ID and Category initially (and aggregate by counting on something like user id, but it doesn't really matter).  This will give you a version of the data you want, only the way you have it, you will need to Pivot the output, where you Group By User ID (again) and Index on Category.
     
    Brian T.
    Lindon Ventures 
    Data Science Consulting from Certified RapidMiner Experts
Sign In or Register to comment.