The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Finding correlations between outputs and inputs for large number of data
Hi everyone!
I'm very new at Rapidminer and I have a question (I just installed it and started toying around with the "auto-model").
What I'm trying to achieve is : I did various tests by variating some inputs, and I have an excel file with for each tests the used inputs, the outputs (temperature, forces...). Since I have a large number of tests, I would like an analysis using a software like rapidminer. I would like to find correlation between inputs and outputs (like I have lower forces for this kind of tests... things like that).
I'm not quite sure if rapidminer is suitable for this? If this kind of analysis is achievable through rapidminer, I would really appreciate if you could indicate me some tutoriel to achieve this or give me some advices here (english is not my first language as you may have noticed and I have difficulties to find something that match my problem. So far on the forum I just found some posts suggesting using auto-model).
Have a good day.
Tagged:
0
Answers
If you are trying to find a correlation between attributes(including output labels), You can use Correlation matrix operator in RapidMiner which provides you with a correlation matrix. In the below scenario I selected Titanic training dataset from samples which have an output label "Survived". I included this so that I can find the correlation between inputs and output. I also provided XML code below for your understanding. You can also observe which of these are highly correlated based on their coloring.
Thanks
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing