The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Precision problem

kotsounikikotsouniki Member Posts: 2 Learner III
edited November 2018 in Help

Hello, I have a problem concerning my project and I would really like to know what I am doing wrong..
I have an example set where there is a column named activities that include walking, sitting, joking, standing etc and I want to evaluate the performance of decision tree model in this set. The problem is when I get to performance matrix all the values in activites are compaired only to walking...and I get no precision for the other values (sitting, joking etc). Can someone help? I'm providing the photos of my project.  Thank you in advance!Capture.PNGCapture2.PNG

Answers

  • Thomas_OttThomas_Ott RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,761 Unicorn
    First off, I would use a Cross Validation operator instead of using a Split operator to make training and testing set.

    Next I would look at how balanced your classes are. From the initial results it appears that the model thinks everything is walking. Go back to this first step and see if there’s anything off with your data.

    I would also try other algos like maybe a Naive Bayes.
  • Telcontar120Telcontar120 RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,635 Unicorn

    Great suggestions from @Thomas_Ott as usual.  From your confusion matrix, it looks like the classes are not balanced, so sampling before modeling might help that.  We also can't tell here how many predictive attributes you have and what type of power they have.  You might start by looking at some of the EDA operators like Weight by Information Gain to understand that better. Finally you could consider transforming your label and building a few binary models for some of the other outcomes (like standing vs not standing or jogging vs not jogging) to see what types of patterns there are. Without a data sample it will be difficult to provide more specific suggestions.

     

    Brian T.
    Lindon Ventures 
    Data Science Consulting from Certified RapidMiner Experts
  • kotsounikikotsouniki Member Posts: 2 Learner III

    Thank you all so much for taking time and answering my question! I am really new to this so I have watched the videos in the site but rapidminer is a really powerful and ,thus, complex tool so I will try your suggestions and let you know if they worked!!

Sign In or Register to comment.