The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Is it possible to download the Inputs selected by automodel and their corresponding parameters
Hello,
I am working on automodel for my data with 77 attributes. I am trying to get all the details of attributes (Columns) analysis done by automodel (Correlation, ID-ness, Stability and Missing Values). Is it possible to download this data showed by auto model into excel or any other file format?
One more question is what is the "?" in ID-ness column in automodel.
Thanks,
Varun
I am working on automodel for my data with 77 attributes. I am trying to get all the details of attributes (Columns) analysis done by automodel (Correlation, ID-ness, Stability and Missing Values). Is it possible to download this data showed by auto model into excel or any other file format?
One more question is what is the "?" in ID-ness column in automodel.
Thanks,
Varun
Regards,
Varun
https://www.varunmandalapu.com/
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing
Tagged:
0
Best Answer
-
IngoRM Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University Professor Posts: 1,751 RM FounderHi @varunm1,Martin is right, we currently do not have any way to export those numbers since there is no operator for this.> One more question is what is the "?" in ID-ness column in automodel.ID-ness is only calculated for nominal columns and integer columns (real numbers are rarely used as IDs anyways). The rational for this is that most real-valued columns would otherwise be (falsely) identified as IDs since it is very likely that they show 100% ID-ness.We of course could still calculate the ID-ness nevertheless, but found it in some usability tests that people are then confused by the inconsistency that some columns with 100% ID-ness (nominal ones) are excluded by Auto Model while others (real-valued ones) are not. So we decided to not calculate the ID-ness for real-valued columns at all to avoid that.(...and yes, this confusion can still happen for Integer columns but here a 100% ID-ness actually IS more frequently an actual ID and people just accept that correct handling without questioning it )Hope this helps,
Ingo7
Answers
Dortmund, Germany
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing
Dortmund, Germany
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing
Vladimir
http://whatthefraud.wtf
Vladimir
http://whatthefraud.wtf
Scott
Need one clarification. Is there any use to have both stability and ID-ness in the automodel as these look like similar things?
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing
Ingo
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing
Regardless of the selection method used (and I can see the arguments for leaving it the way it is in Automodel), the current Automodel process calculates a value for each attribute for 5 quantities: correlation, id-ness, stability, missing, and text-ness (that's a new one!).
It would be nice to have an operator which generated these same values inside any process and provided the results as a dataset. You could then use that operator to create filtering/weighting/selection rules of your own choosing based on whatever threshold values you wanted. Currently you can do that for things like missing value percentage or correlation (because there are operators that can be used to calculate those) but not for the others (as far as I know). So there is still a gap in the capabilities of Automodel vs non-automodel processes.
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts
Ingo
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing
Ingo
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing
Ingo