The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Auto Model lost data
data:image/s3,"s3://crabby-images/e9e37/e9e376f86fc989f8be36462752cae2b4a4f55b06" alt="DrGintoki2021"
data:image/s3,"s3://crabby-images/5f468/5f4680711dcf5b2bea70da8891109c95c08b4440" alt=""
hi guys,
(1)when I was using auto model in rapidminer 9.8 and trying to predict the values of a column,
(2)I found that it only show 65 rows of data and the confusion matrix only show me less than 50 data——
(1)when I was using auto model in rapidminer 9.8 and trying to predict the values of a column,
(2)I found that it only show 65 rows of data and the confusion matrix only show me less than 50 data——
(3)actually I have 162 rows.
so... why? how to show me the whole 162 rows predictions performance and confusion matrix?
thanks !!!data:image/s3,"s3://crabby-images/42d88/42d88020db5245bd6d26803f3a1989b6bedd4004" alt="Image: https://us.v-cdn.net/6030995/uploads/editor/0o/21glvilxv5xh.png"
data:image/s3,"s3://crabby-images/d52f0/d52f03a8dcaf97c497bf5b1bccca4107d3542b49" alt="Image: https://us.v-cdn.net/6030995/uploads/editor/bg/90ccbqtzu8pp.png"
data:image/s3,"s3://crabby-images/99f9e/99f9e7f9f48d3c9bfa3e6e3bf9f7808ddfc0d6b1" alt="Image: https://us.v-cdn.net/6030995/uploads/editor/h5/o5d8fykyn2t8.png"
so... why? how to show me the whole 162 rows predictions performance and confusion matrix?
thanks !!!
data:image/s3,"s3://crabby-images/42d88/42d88020db5245bd6d26803f3a1989b6bedd4004" alt="Image: https://us.v-cdn.net/6030995/uploads/editor/0o/21glvilxv5xh.png"
data:image/s3,"s3://crabby-images/d52f0/d52f03a8dcaf97c497bf5b1bccca4107d3542b49" alt="Image: https://us.v-cdn.net/6030995/uploads/editor/bg/90ccbqtzu8pp.png"
data:image/s3,"s3://crabby-images/99f9e/99f9e7f9f48d3c9bfa3e6e3bf9f7808ddfc0d6b1" alt="Image: https://us.v-cdn.net/6030995/uploads/editor/h5/o5d8fykyn2t8.png"
Tagged:
0
Best Answer
-
lionelderkrikor RapidMiner Certified Analyst, Member Posts: 1,195
Unicorn
Hi @DrGintoki2021,
It is because AutoModel is using a multi hold out validation method.
AM is using 40 % of your initial dataset to test/evaluate the performance of your model.(and the remaining 60 % of the initial dataset to train your model)
For that it split the 40% of your initial training set into 7 folds .
Then he calculates the performances for each of the 7 folds.
then he remove the max performance and the minimum performances of the 7 performances
and thus he keep 5 performances and display the confusion matrix for this remaining 5 folds.
In other words you have in your confusion matrix : 162 data points x 0,4 (40%) x 5/7 = 46 data points
It matches with what you are displaying in the picture you shared, we have 46 = 20+21+5 in your confusion matrix...
EDIT :
Take a look at the "information" panel in the results screen of AutoModel : look at Model -> Performance to have a description of how is calculated the performance in AutoModel.
Regards,
Lionel1