The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Cutting reviews in phrases, while still knowing to what video gamthey belong to.
Hi all.
I have multiple reviews from video games in a dataset, in which i want to cut into phrases. However, I still need to know to which video game they belong to. So lets say we have Game A and Game B. If Game A has 4 phrases, I want to chop up the document to those 4 phrases, while in the next column i can see to which game the sentence belong to.
I have tried some methods, but unfortunately my experience with rapidminer is too limited to get this done.
I have multiple reviews from video games in a dataset, in which i want to cut into phrases. However, I still need to know to which video game they belong to. So lets say we have Game A and Game B. If Game A has 4 phrases, I want to chop up the document to those 4 phrases, while in the next column i can see to which game the sentence belong to.
I have tried some methods, but unfortunately my experience with rapidminer is too limited to get this done.
0
Answers
have a look at this process: The first part is only about creating fake data for demo purposes. The real meat begins with "Split". You split on either . or ! followed by a whitespace. De-Pivot then grabs the resulting columns and converts them to rows.
Note that I have added a breakpoint before "Split" so you can inspect the input data which is probably somewhat similar to what you have. Once you run the process, you will activate the breakpoint, pausing the process. After looking at the data, you can press the (now green) run button again to finish the process.
Cheers,
Marco