The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
"Text Classification with two languages in one model"
Hi,
I hope that my question is a newbie question.
I want to make a text classification tool for two languages. For that I want to create two Textinput-Elements with two different Stemmer-Elemets (one for each language). Is it possible to load both textinput-elements with the same two labels in one LibSVMLeaner?
Thanks for any help,
Anja
I hope that my question is a newbie question.
I want to make a text classification tool for two languages. For that I want to create two Textinput-Elements with two different Stemmer-Elemets (one for each language). Is it possible to load both textinput-elements with the same two labels in one LibSVMLeaner?
Thanks for any help,
Anja
Tagged:
0
Answers
unfortunately I'm not quite sure what you are aiming at. The TextInput operators produce an exampleset containing the informations about each processed text as a single example. The Learning operators (and hence the LibSVM, too) use an exampleset for learning.
If your question is, if the text may occur twice in the example set with the same label and probably slightly changed word values: Yes this works. BUT: The performance will be probably not as good as training two SVM, one for each used stemming.
Greetings,
Sebastian
the question was if you could train one SVM with two examplesets. I decided to train two SVMs, but it's working.
Attached the XML for those who are interested. Best regards,
Anja
yes it works, but it only trains on one ExampleSet. You would have to merge the two sets, in order to train on all data. Use the ExampleSetMerge operator for this.
Greetings,
Sebastian