The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
"using SVM with WVT"
data:image/s3,"s3://crabby-images/3cfbb/3cfbba02a2b493bf3a2cd96c0e27d3b8df46c2a0" alt="IngoRM"
data:image/s3,"s3://crabby-images/dfa81/dfa81c99b23088ce6f6b14f776b3a4b9ce7da500" alt=""
Original message posted on SourceForge forum at http://sourceforge.net/forum/forum.php?thread_id=2041847&;forum_id=390413
Hi all,
As part of my thesis, I need to use SVM in order to classify some document vectors. In order to do that, I'm using the Word Vector Tool (WVT) extension.
When I used RapidMiner in the past it was only for clustering, so all I had to do was put all my documents in a single directory and then use the following code:
List<Object[]> textList = new LinkedList<Object[]>();
//adding the directory
textList.add(new Object[] {"test_set","D:/programming/eclipse/Thesis/datasets/documents"});
Operator wvtoolOperator = OperatorService.createOperator("WVTool");
wvtoolOperator.setListParameter("texts", textList);
Now, however, I need to set different values for two groups of documents. So I tried to use the following code:
WVTFileInputList list = new WVTFileInputList(2);
list.addEntry(new WVTDocumentInfo("D:/programming/eclipse/Thesis/datasets/confidential", "txt","", "english", 0));
list.addEntry(new WVTDocumentInfo("D:/programming/eclipse/Thesis/datasets/unconfidential", "txt","", "english", 1));
The problem is I wasn't able to find out how get this list into the wvtoolOperator object.
Help will be very much appreciated.
Gil
Answer by Ingo Mierswa:
Hello,
just use
List<Object[]> textList = new LinkedList<Object[]>();
//adding the directory
textList.add(new Object[] {"confidential","D:/programming/eclipse/Thesis/datasets/confidential"});
textList.add(new Object[] {"unconfidenctial","D:/programming/eclipse/Thesis/datasets/unconfidential"});
Operator wvtoolOperator = OperatorService.createOperator("WVTool");
wvtoolOperator.setListParameter("texts", textList);
Cheers,
Ingo
Hi all,
As part of my thesis, I need to use SVM in order to classify some document vectors. In order to do that, I'm using the Word Vector Tool (WVT) extension.
When I used RapidMiner in the past it was only for clustering, so all I had to do was put all my documents in a single directory and then use the following code:
List<Object[]> textList = new LinkedList<Object[]>();
//adding the directory
textList.add(new Object[] {"test_set","D:/programming/eclipse/Thesis/datasets/documents"});
Operator wvtoolOperator = OperatorService.createOperator("WVTool");
wvtoolOperator.setListParameter("texts", textList);
Now, however, I need to set different values for two groups of documents. So I tried to use the following code:
WVTFileInputList list = new WVTFileInputList(2);
list.addEntry(new WVTDocumentInfo("D:/programming/eclipse/Thesis/datasets/confidential", "txt","", "english", 0));
list.addEntry(new WVTDocumentInfo("D:/programming/eclipse/Thesis/datasets/unconfidential", "txt","", "english", 1));
The problem is I wasn't able to find out how get this list into the wvtoolOperator object.
Help will be very much appreciated.
Gil
Answer by Ingo Mierswa:
Hello,
just use
List<Object[]> textList = new LinkedList<Object[]>();
//adding the directory
textList.add(new Object[] {"confidential","D:/programming/eclipse/Thesis/datasets/confidential"});
textList.add(new Object[] {"unconfidenctial","D:/programming/eclipse/Thesis/datasets/unconfidential"});
Operator wvtoolOperator = OperatorService.createOperator("WVTool");
wvtoolOperator.setListParameter("texts", textList);
Cheers,
Ingo
Tagged:
0