The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Problems for choosing SVR or SVC in LIBSVM?
Dear all,
I want to solve a simple text classification with only two classes. Obviously C-SVC in libsvm is a OK choice. It's just when I went through some relative papers, I found that the SV regression may also be used to solve these kinds of text classification problems and the results may be more accurate. I got confused.
Could the SV regression really be also chosen to solve classification (eg, text classification) problems?
If so, how could choose the proper parameter to do that?
PS: the training set has already been separated in to two folders appropriately.
I know my questions are quite silly, but I'm really confused. :-[
Thanks in advance.
I want to solve a simple text classification with only two classes. Obviously C-SVC in libsvm is a OK choice. It's just when I went through some relative papers, I found that the SV regression may also be used to solve these kinds of text classification problems and the results may be more accurate. I got confused.
Could the SV regression really be also chosen to solve classification (eg, text classification) problems?
If so, how could choose the proper parameter to do that?
PS: the training set has already been separated in to two folders appropriately.
I know my questions are quite silly, but I'm really confused. :-[
Thanks in advance.
0
Answers
basically all regression schemes can also be used for classification tasks, this is nothing specific to SVM. In fact, the classification of a linear regression is directly employing this idea, the same is true for the classification meta learner "Classification by Regression". One of the ideas people often have is to directly model the prediction probability instead of using and / or transforming the confidences.
About the optimization of parameters in this setting: Just use a regression performance criterion or transform the predicted regression values into a class before calculating the performance.
Cheers,
Ingo
Many many thanks, Ingo. That's very helpful. I guess I still have a lot to learn. Thanks, again.
lucky.