Text analysis & preprocessing
Hi Guys
Really new to Rapidminer and text mining. I have a CSV with 3000+ tweets from one user. I want to remove all the stop words from the dataset. I have tried following examples but I end up with blank results. Could anyone explain the stages of this process in Rapidminer it would be appreciated.
Thanks
Best Answer
-
IngoRM Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University Professor Posts: 1,751 RM Founder
Hi,
Here are some generic hints about how to perform text analytics in RapidMiner.
For all types of text analysis, you will need the Text Mining extension for RapidMiner which you can download for free from our Marketplace. You can find it in the menu “Extensions” – “Marketplace” and type “Text” in the search box (here is also a link directly to our marketplace:https://marketplace.rapidminer.com/UpdateServer/faces/product_details.xhtml?productId=rmx_text). There are also many more extensions on our Marketplace so make sure that you check them out…
There is a community member who created a nice set of tutorials for text analysis with RapidMiner: http://vancouverdata.blogspot.com/2010/11/text-analytics-with-rapidminer-loading.html
Finally, there are two more extensions which might be interesting from our partners (Aylien and Rosette).
Hope this helps,
Ingo
1