The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Confusion Matrix for Analyze Sentiment Operator
Hello. I have not used RapidMiner for a while, so please forgive the noob question. I am trying to recreate a RapidMiner model that I had read in an academic journal. The model uses the Aylien "Analyze Sentiment" operator to do sentiment analysis of a movie review dataset. I need to generate a confusion matrix for this model. I tried to insert a performance operator, but it kept giving me an error saying it was expecting a label. Each item in the dataset comes with a "polarity" label, so I don't know what I am not doing correctly. Any suggestions?
0
Answers
Yes, any performance operator for models requires a label to run. The polarity from the operator is the prediction, not the actual label. To generate a confusion matrix, you need to have an independent label, so it can compare the prediction against the actual outcome. Unless you hand label the texts you are running through the Aylien API, you are not going to be able to generate a confusion matrix even if you pretend the Aylien prediction is the actual label, because one half of the matrix will be empty (or the prediction will always match the actual, which is a useless output).
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts
Actually these set of videos related to the validation of the classification model to split the data into training and testing sets.
I am talking about the unsupervised clustering process (but for the sake of evaluating the model the ground truth classes are given. i.e, it will apply after applying the clustering model for evaluation purposes). Moreover, the actual and predicted classes are multi not binary clases.
Any help I will appreciate ...
Regards
It could be easier to help you if you could share here actual dataset on which you want to produce confusion matrix.
Vladimir
http://whatthefraud.wtf