سلام. وقت بخیر
چگونه میتوان بعضی از ستون های اکسل را در بکارگیری مدل موثرتر و کارآمد تر بیان کرد؟
--------------------------------------------------------------------------------------------------------------------------------------------------------------------
Hello
how we can make some columns more important and effective?
The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
making some columns more important
[Deleted User]
Posts: 0 Newbie
Best Answer
-
rfuentealba RapidMiner Certified Analyst, Member, University Professor Posts: 568 UnicornHello @mbs:
You are connecting these the wrong way.
The line marked with an X and a 1, going between the exa output in the Decision Tree operator and the exa input in the Set Role operator shouldn't be there. Instead, replace it with the black line marked with a 2, because the predicted label is added by the Apply Model operator when you apply a model through the mod input and a set of not labeled data in the unl input.
First, let's fix this and then we can continue with weightlifting weight handling. I am setting up an example for you.
1
Answers
Can you please explain more about your requirement?
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing
I mean that I want make some columns more effective on the algorithm.
some columns are more important Features so i need to make them more effective
thank you
it has label
thank you for your help I will try it and then for weight if I have any problem I will ask
also is weighting good way for making some columns more effective?
mbs
Here is your example on how to Select by Weights. There are some more things you should know, but first:
- I convert everything to Numerical because weighting can't be applied to categories.
- Splitting the data stratifying the examples.
- Applying the weighting by correlation method to the stratified examples. You can select any kind of weighting at this point.
- Selecting the most important weights to train our Decision Tree.
- The rest is standard procedure.
You may also want to use Decision Tree (Weight Based) or DBSCAN (Weight Based), as not all ML algorithms support weight-based operations.Now, this process takes only the most important weighted columns, and discard the others. Here is the XML, in case you wish to experiment:
Hope this helps.
Yes, it's very recommended to use weighting for this. Most of the time, it's even more recommended than upsampling or downsampling.
All the best,
Rodrigo.
thank you very much for your help
With your example I change the data with my dataset but I have to add some more operator in order to process work with my data. please look at these screen shots and also I can not understand the result
Any way thank you for your help
the points that you mentioned in the screen shot works but still i use weighting by information gain because the correlation operator doesnt work with my data and also I changed tree to the ruleinduction and the result is 98.86
thank you
To be clear: my example was a quick one to show the specific ordering of the elements. If you want to do weighting by awesomeness, go ahead, hahaha.
The result is a confusion matrix or something, where you need to see:
- How many predicted positives are in the true positives list?
- How many predicted negatives are in the true negatives list?
- The class precision = how precisely can you select the sampled true positives (or sampled true negatives)
- The class recall = how precisely can you select all the true positives (or true negatives)
In the case you showed, 100% is a perfect score. The problem would be if with that 100%-scored algorithm you can score the 100% of the new phenomena, because if it doesn't, you are causing overfitting to your model.All the best,
Rodrigo.
@rfuentealba
with your example and my data i can not understand the result and it is not clear. but with my example every thing is clear