The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
top TF-IDF keyword
Hello
I want to extract five words with the highest tf-idf in the output tf-idf matrix.
How should i do ???
Thanks
and how remove '@' , '#' charachters and url from sentence in rapidminer???
Tagged:
0
Answers
Hi @ahootanha,
To answer to your first question, you can find here a process which perform what you want to do :
I hope it helps,
Regards,
Lionel
If you Tokenize on non-letters, all the special characters will be stripped from the resulting words that comprise the word vector.
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts
@ahootanha what @Telcontar120 says is true. My suggestion is the use the Specify Characters in the Tokenize operator to select what to split on. I do a lot of Twitter extraction and I don't want #hashtag to get wiped out by default, so I split on stuff like !.?"[ but not on #.
Hello
thank you
But
I am a beginner
I did not understand where to use these codes
How to write a regular expression in the filter token operator?
Please guide
Thanks
Can you give more guidance? And an example
@ahootanha grab the process here: http://www.neuralmarkettrends.com/use-rapidminer-discover-twitter-content/
Hello
thank you very much
But
I do not know where to use these code in my rapidshare program?
Please guide
Send me a screenshot of the implementation of operators
Thanks
Hello
thank you very much
But
I do not know where to use these code in my rapidminer program?
Please guide
Send me a screenshot of the implementation of operators
Thanks
@ahootanha take a look at this thread: https://community.rapidminer.com/t5/RapidMiner-Studio-Forum/Import-XML-code-to-process/m-p/32606#M23194
hello @ahootanha welcome to the community! Some quick recommendations for you (pretty much exactly what @Thomas_Ott was recommending)...
• Post your XML process here in this thread (see https://youtu.be/KkgB5QXWXJ8 and "Read Before Posting" on right when you reply)
• Attach your dataset if possible (use a fictionalized version if there are privacy concerns)
• Make sure you have all dependent extensions installed (see https://youtu.be/pjBqG3xtXx4)
Scott
Hello
I saw links to YouTube
I installed all the packages
But still can not
Extract ten repetitive words from the tf-idf matrix
Please guide
Thanks
Hello I saw links to YouTube I installed all the packages But still can not Extract ten repetitive words from the tf-idf matrix Please guide Thanks
Should you run the program after
Write xml code?
how?
hello @ahootanha I really need to see your data and your XML process in order to help. Can you please post both here in this thread?
Scott
Hello
Thank
I did not use coding
I just entered the data and used the process document (TF-IDF)
Thank you for helping me
Please