Sentiment Anaylsis of Twitter Data
data:image/s3,"s3://crabby-images/e9e37/e9e376f86fc989f8be36462752cae2b4a4f55b06" alt="marlene_boettch"
data:image/s3,"s3://crabby-images/2b985/2b98579ca4abd9381f526177f3044c84ba7cc6a3" alt=""
Hello there,
I would like to conduct a sentiment analysis of Twitter data. I already looked in the forums for a solution to my problem, but all entries I found were very incomprehensible to me. (I really am a desperate beginner with a hardly any talent for anything to do with technology)
My problem is probably quite simple: I have 10000 tweets stored in a CSV file. Now, as already mentioned, I would like to carry out a sentiment analysis. My teacher gave me a process for that. However, in this process the tweets must be read in as separate text files.
Now my question: How do I filter the tweets from the CSV file so that each tweet (and only the text, not the other information like username, ID and so on) is stored in a separate txt file?
As I said, my understanding of RapidMiner is unfortunately very limited, so I would be very grateful if someone could explain it to me as simply as possible.
Thank you very much and have a nice day
Marlene
Answers
Hi @marlene_boettch,
Difficult to help you without your .csv file and your process...
Can you share them in order we better understand your problem ?
Regards,
Lionel
Hi,
Data to Document, Loop Collection, Write CSV is a way to do this. But i think you can just use Read CSV+data to Document and you got it in the format you need for your analysis.
BR,
Martin
Dortmund, Germany
There are also a number of recent threads that have almost exactly the same type of problem. See the ongoing discussions here, for example, both of which have several example processes:
https://community.rapidminer.com/t5/Getting-Started-Forum/Errors-Twitter-data-Suddenly-Attribute-Label-Missing-Inside/m-p/52708#M3254
https://community.rapidminer.com/t5/Getting-Started-Forum/Non-nominal-label-the-lavel-attribute-must-be-nominal/m-p/35581#M263
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts
Hello to all who have answered until now!
From what has been written here so far, I have managed to bring my tweets from the CSV document into an IO Object Collection. This is now in the form of an .md file. Unfortunately I cannot use the tweets in this way because I need them as individual text files to be able to read them into the process and use them further. This is the process:
I'm sorry if I'm being dumb, but my poor understanding of the subject matter and my mediocre English skills just make me a little desperate.
data:image/s3,"s3://crabby-images/a752f/a752fc9865231bdcd920a37a85fbf412d4fb2fc2" alt=":) :)"
Thanks for your help so far!
Kind regards
Marlene
Hi @marlene_boettch,
i think you only need one Process Documents per CSV file you have. Chaining two is a bit odd.
Attached are two processes, one is separating the texts. I do not think you need this. The second is showing you what i think you need to do.
BR,
Martin
Seperating files:
How to tokenize the csv directly:
Dortmund, Germany
Hello @marlene_boettch - have you tried looking at some of our community processes from within RapidMiner Studio? There are now two that do things very similar to what you are looking for:
Scott