The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
How to increase the accuracy of the classifiers?
haziqros_97
Member Posts: 5 Learner II
in Help
Hye everyone!
I really need your help. I am doing my thesis research using RM. My supervisor asked me to use Turbo Prep. The issue I faced now is the accuracy of the classifiers is below 70%. My supervisor wanted me to get above 80%, if possible. Can you guys give me some tips on how to increase the accuracy?
Thanks in advance.
I really need your help. I am doing my thesis research using RM. My supervisor asked me to use Turbo Prep. The issue I faced now is the accuracy of the classifiers is below 70%. My supervisor wanted me to get above 80%, if possible. Can you guys give me some tips on how to increase the accuracy?
Thanks in advance.
0
Best Answer
-
lionelderkrikor RapidMiner Certified Analyst, Member Posts: 1,195 Unicorn@haziqros_97,
You can find here a process with a performance strictly greater to 70 % using :
- Smote upsampling
- feature engineering
Feel free to increase the time limit associated to the Automatic Feature Engineering operator (currently se to 180 seconds).
I think that to reach 80 % accuracy will be extremely difficult, maybe impossible...
Keep in mind that the accuracy is not the only criteria (the "Kaggle syndrom") to choose a model, you have to take into account :
- the simplicity of the model
- the interpretability of the model
- the capacity to "generalize" of the model...
Hope this helps,
Regards,
Lionel0
Answers
wide subject !!!
You can :
- preprocess /clean your data : impute missing value, replace rares values etc.
- perform feature engineering (feature selection/feature generation)
- try different models (NB, neural networks, Decision tree)
- optimize the parameters of the best model (using Optimize Parameters operator)
and ideally perform a Cross Validation to evaluate the accuracy of your models. It is considered as a best practice, representative of the performance of your model(s) on future unseen data.
I hope it helps !
Regards,
Lionel
PS : Note that all the 4 steps described above can be automatically performed by AutoModel inside RapidMiner...
Please help me, guys.
Thanks in advance.
What is the label in your dataset ?
Lionel
And an other process using the same preprocessing steps described above, but using the deep-learning library....
Hope this helps,
Regards,
Lionel