The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Text Data Cleaning, Preprocessing and Text Mining with Rapidminer
I have a large number of English texts (online reviews). I need to do a data cleaning, preprocessing on them and then mine which are the effective high-frequency words? For example, "bathroom", "traffic", etc., are likely to appear as valid high-frequency words. Do you have any specific steps to do so?
0
Answers
Yes, you can use Rapidminer for that and you have specific content in Rapidminer Acafemy for free.
please check this thread
please check this thread
https://community.rapidminer.com/discussion/60355/can-process-documents-calculate-term-occurences-of-all-words-without-having-to-give-it-a-word-list#latest
best,
Cesar