The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
"[SOLVED] can't get decision tree to work"
Hello,
I'm new to rapid miner and I'm trying to create a decision tree which ultimately shows variables that lead people to churn. I got the data from here:
http://www.dataminingconsultant.com/data/churn.txt
I first saved it as a csv file. Imported into rapidminer and set everything as nominal. phone# as ID, churn as label, and everything else as attribute.
from there I put the data linked to to a x-validation operator, and when I run it, the tree only shows up one box (leaf, node?) sorry it might be a stupid question, but is the first process I do in rapid miner.
any help greatly appreciated.
I'm new to rapid miner and I'm trying to create a decision tree which ultimately shows variables that lead people to churn. I got the data from here:
http://www.dataminingconsultant.com/data/churn.txt
I first saved it as a csv file. Imported into rapidminer and set everything as nominal. phone# as ID, churn as label, and everything else as attribute.
from there I put the data linked to to a x-validation operator, and when I run it, the tree only shows up one box (leaf, node?) sorry it might be a stupid question, but is the first process I do in rapid miner.
any help greatly appreciated.
Tagged:
0
Answers
I use that dataset to teach DataMining and was having the same problem you are having. So I was going to post a comment about the dataset but decided to give a try one more time with version 5.2. It turns out that the problem has been fixed.
if I may ask.what attributes did you give to the data when you imported it? I'm thinking that might be it.
label: churn
id: phone number
attributes: all other ,
I also set the attribute type of area code to polynominal
One comment: I typically use the gini index as criterion (I learned my trees from Breiman ). If you use the default criterion you get a tree with a single node unless you play with (lower) the minimal gain to, say, 0.01.
Hope this helps,
\E.
For some reason it seems it is not clustering the data points but rather it analyzes each one and it includes each data point in the decision tree. if that makes any sense. Even reducing the dimensions I know don't have any weight doesnt make it work. I'll be posting a picture of the results soon
https://docs.google.com/open?id=0B2yDHlsQxOyabEdrWnk2NUg2Qk0
I played around with the variables and set ID + label = nominal. Everything else was set to real and it gave me a tree. Hooray.
thanks for the help