The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Convert categorical variables into dummy variables
aisyahwahyuna
Member Posts: 2 Learner I
in Help
Hi, I want to perform a regression task to predict continuous response. I have 4 categorical variables, others are numerical.
Categorical variables are:
age=(≤20, 21-35, 36-50, ≥51)
gender=(Female, Male)
income level=(1=insufficient, 2=sufficient)
BMI range=(1=<25, 2=>25)
*Income level & BMI are keyed in as numerical code in my dataset
Let's say I want to perform SVM, RF, Decision Tree, MLR, and KNN;
1. Should I convert all categorical variables into dummy variables?
2. If using numerical coding is more suitable, should I change the data type to nominal (binominal/polynominal) or retain it as integer?
Tagged:
0
Answers
Where you do want to use a model that can't support categorical variables, I'd personally be very careful in using numerical coding and recommend dummy encoding as a preferred method - here the nominal to numerical operator should work well. It can be appropriate in some instances, especially when it's binominal, but I use it sparingly as it can cause biasing of the output of your model. Hope this helps!