The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

"[SOLVED] Text mining on text attribute of a CSV?"

wypeewypee Member Posts: 7 Contributor II
edited June 2019 in Help
Sorry for this rather noob question. I am just getting started with RapindMiner by trying it on one of the Kaggle challenge that requires categorizing user comments as insulting/not-insulting. Now there is this input CSV with tens of thousands of entries of user comments but I notice that all of rapidminer text mining operators expect document. So, am I to split each row of the CSV as a file and then feed as a set of documents to rapidminer? That doesn't seem right. What is the right way of doing it?

Thanks in advance.

Answers

  • awchisholmawchisholm RapidMiner Certified Expert, Member Posts: 458 Unicorn
    Hello wypee

    Use the Process Documents from Data operator. Before this you need to convert the type of the attribute containing the data you want to analyse to text. Use the Nominal to Text operator for this.

    regards

    Andrew
  • wypeewypee Member Posts: 7 Contributor II
    That's what I needed! Thanks a lot, Andrew.  :)
Sign In or Register to comment.