The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
classification models
I have a data set with different variables about a disease and a variable with yes or now about the patient(if he has the disease or not). I have to create two classification models(decision tree and knn) and to make a diagram with a centralized chart of the performance of these individual methods. How I will do it? What is the process I have to follow? I saw the videos but I got confused as a beginner in rapidminer..
Tagged:
0
Answers
What performance metric do you want to calculate ?
For a binominal problem (like yours), you can use the Compare ROCs operator.
Here a sample process using this operator :
Regards,
Lionel
@lionelderkrikor gave perfect example to compare two models. In case you are looking to get the performance indicators like AUC, Kappa etc using Cross-validation (recommended) you can check below code. Here you need to note the performances and check which worked well.
Varun
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing
The "code" you see is the XML of the process. You have to import it in RapidMiner :
The step by step is :
In order to import such an XML description of your process, e.g. to use a process someone else has posted here in the forum, please follow the following steps:
- Create a new process and go the the XML panel (see above).
- Clear the view and copy the XML code you got into that panel.
- Then press the green checkmark icon on top of the panel.
- Switch back to the Process panel.
Regards,Lionel
@varunm1you gave me an operator "generate data". In your code about what data it works? And how i ll made it to work for my data? Where will I insert them?
They dont tell us which method for the diagram to use because we are beginners and obviously they wanted to find one by searching...which diagram to prefer as a beginner?
As we don't have your dataset, I just randomly generated data, In your case you need to delete generate data and import your data into RapidMiner by specifying label column in column attribure and attach your data set to the multiply operator. Multiply operatir just creates a copy of dataset to use for two algorithms which are inside cross validation operator. You can double click cross-validation and see which model is placed inside. You can see below tutorial from RM to see how to import data.
https://www.youtube.com/watch?v=eLR0IiBT76w
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing
I am looking the two xml codes and i want to see which describes best the difference between the two models:decision tree and knn. I will see the curves googling!