The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Polynominal sentiment analysis in SVM
HeikoeWin786
Member Posts: 64 Contributor I
in Help
Dear all,
I am trying to perform SVM on the dataset where customer review as polynominal and sentiment score as bionominal. I had read the tutorials and figured out that SVM can only handle numerical and needed to convert nominal to numerical. However, is it to convert both customer reviews ans sentiment score to numerical? In which steps we need to convert? After processed the data? I am a bit confused of how sentiment analysis work in SVM in rapidminer. The RM tutorial under the sample templates is using text and binominal and not even converting to numerical.
Can anyone suggest me how to fix this issue correctly?
I had attached my process flow for your easy reference.
thanks.
Heikoe
I am trying to perform SVM on the dataset where customer review as polynominal and sentiment score as bionominal. I had read the tutorials and figured out that SVM can only handle numerical and needed to convert nominal to numerical. However, is it to convert both customer reviews ans sentiment score to numerical? In which steps we need to convert? After processed the data? I am a bit confused of how sentiment analysis work in SVM in rapidminer. The RM tutorial under the sample templates is using text and binominal and not even converting to numerical.
Can anyone suggest me how to fix this issue correctly?
I had attached my process flow for your easy reference.
thanks.
Heikoe
0
Best Answer
-
jacobcybulski Member, University Professor Posts: 391 UnicornYou will face here a dillema of accuracy vs efficiency. If your data sample is sufficiently large it is likely that a simple split/holdout validation inside the drid optimisation will be enough. However, if you though that you needed cross-validation in the first place then I'd also place it inside the grid optimisation. What you may decide is to reduce the number of folds. Let's say that you optimise on C and gamma, both in 10 log steps, and you placed SVM inside a 10-fold cross validation, it means you'll be running SVM 10x10x10=1000 times. If you believe that a simple split will result in two representative samples then you can reduce the runs to 100.
5
Answers
Could anyone please help me with this understanding, please?
If you shared your Excel then we could run your process and see what's going on.
Scott
@jacobcybulski
Hello both,
Thanks for your kind input.
Yes, I had changed the label to binominal and processed the data.
I am really not sure how to pick the optimum model (I tried SVM and NBC so far).
I see the sample sentiment analysis is using SVM in cross validation as well.
Let me check the sample repository once more again and explore other models.
And, yes for Sure, I can share with you the file.
Much appreciated for all your input for real!!
thanks and regards,
Heikoe
Thanks much again here also.
Does it mean, I need to place my cross validation process inside Grid Optimizer?
Currently, inside cross validation is SVM process.
So, now, i put corss validation inside grid and run the process, it will return the parameters which best fits. I take that parameter and apply that in the actual SVM process. AM i correct?
thanks and regards,
Heikoe