The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Can we encode categorical data to numerical and then find the correlation in Rapidminer
Can we encode categorical data to numerical and then find the correlation in Rapidminer? if so please let me know the process
Tagged:
0
Answers
For example, if the data is actually nominal in nature, meaning it is not inherently ordered (think of things like colors or names) then a simple numerical replacement (where each nominal category is given a successive integer value) is actually very misleading. That type of numerical conversion is only appropriate when the nominal categories correspond to some kind of ordered scale (similar to a Likert scale). For other nominal data, you would want to do dummy coding conversion, which takes each nominal value and turns it into a zero/one variable (called a dummy code) and then you can run a correlation analysis on those attributes.
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts
This is BTW what the correlation matrix in RapidMiner's Auto Model is doing. You can open the process and see how it is done on your data #noblackboxes
Best,
Ingo