The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
"Sentiment Analysis - Choosing the right process"
Dear All,
I am very new in this wonderful world of data mining and I have to say I am more than impressed. I will try to sum up my problem in few words:
I have an excel file with two columns -> column A contains phrases (text) expressing opinion on a certain matter while column B has the character n or p in case the sentiment in the aforesaid phrases is negative or positive respectively. Obviously, p and n have been inserted manually by me.
(e.g.: The site is very helpful->p / All this is awful->n)
What I want to do is to use the above mentioned file as a training set of data and use it to learn a model to apply on other data (that is similar phrases expressing opinion on a specific matter). What I need to know is which operators to use to establish the required process.
Really counting on your support and thanking you in advance,
Kind Regards
I am very new in this wonderful world of data mining and I have to say I am more than impressed. I will try to sum up my problem in few words:
I have an excel file with two columns -> column A contains phrases (text) expressing opinion on a certain matter while column B has the character n or p in case the sentiment in the aforesaid phrases is negative or positive respectively. Obviously, p and n have been inserted manually by me.
(e.g.: The site is very helpful->p / All this is awful->n)
What I want to do is to use the above mentioned file as a training set of data and use it to learn a model to apply on other data (that is similar phrases expressing opinion on a specific matter). What I need to know is which operators to use to establish the required process.
Really counting on your support and thanking you in advance,
Kind Regards
Tagged:
0
Answers
A B
EXPRESSION POLARITY
I am sick with this situation n
They are idiots and incapable n
this is extremely useful p
it's getting worse everyday n
I believe it is a good step p
......................................
Once again, p stands for positive and n for negative attitude reflected on the short phrases of column A.
My question is which operator should be used to create a model which would learn from an excel sheet as the above mentioned.
The model in mention will then be used for an excel sheet consisting only of phrases and not sentiment (POLARITY).
Anyone with a piece of advice is more than welcome....
Thank you
this is essentially the same as video 5 in the text analytics series on my blog.
you are trying to classify those phrases as negative or possitive. this is classification, with 2 classes.
you'll want to create a word vector, with a column for each (unique) word or n-gram, then use a classifier such as SVM to learn the model.
As you may have seen, I was so keen on finding a solution that I had sent you an email as well.
I have to say that the hints you gave were quite helpful and I am closer to get where I am aiming to. I am really thankful.
I will try a few things and come back in case any further question arises.
Ofcourse, anyone else's approach on the matter is welcome and I am looking forward to encountering it