The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Exporting the preprocessed text data into Excel File

lolollolol Member Posts: 5 Learner I

Hi,

I really need your help. I have created a predictive model. The dataset I'm using contains several text attributes that were merged into one text attribute/column during the preprocessing phase. This preprocessing involved steps such as tokenization, stopword removal, token filtering, stemming, and generating n-grams. Could you please guide me on how to unmerged back all those text attributes and  into different columns as they were before? Additionally, I would like to export the data into an Excel file so that I can compare the original unprocessed text with the preprocessed text. Thank you for your assistance.

Answers

  • ruhailaruhaila Member Posts: 48 Guru
    Hi.

    Since there's no .rmp or data files included in your post, I am merely assuming that you could use the SELECT ATTRIBUTES operator to unmerge attributes. Exporting Excel has the WRITE EXCEL operator. hth.
  • BalazsBaranyBalazsBarany Administrator, Moderator, Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert Posts: 955 Unicorn
    Hi!

    Usually the easiest way for getting back the initial data is creating an ID (if you don't have one yet) and multiplying the dataset. Then you do your arbitrary preprocessing, keep the ID attribute, and then join back after the processing.

    Regards,
    Balázs
Sign In or Register to comment.