The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Using Coordinate Data in predictive models
davidnealbrown
Member Posts: 4 Contributor I
I have data that includes coordinates X,Y,Z + condition and a label of interest (outcome). How do I build a predictive model using these coordinates as a factor?
Tagged:
0
Answers
Hi,
Is this data in tabular format, i.e. can you read it into RapidMiner and end up with 4 columns for x,y,z, condition and then a 5th one with the outcome information? If yes, all you need to do is to define the outcome column as label using the "Set Role" operator and using any of the included machine learning algorithms to build the model.
And if this does not tell you anything, I highly recommend to go through the tutorials first which are available at the help screen which comes up if you press the icon in the top right corner of RapidMiner.
Hope this helps,
Ingo
Thank you Ingo. This does help and yes the data sets are in tis format. So if I understand you correctly you are saying that the rapid miner models will automatically consider the x,y, and z coordinate in the example rather than considering an individual axis. Is that correct?
Just to clarify further, I am working with fMRI data - brain activation coordinates in Tailarach standardized sapce. here is a sample spread sheet. I know there is not enough data for a model it is for illustration purposes only.
Thanks!
DNB
What @IngoRM is saying is that the coordinates will just be input in your model and won't carry with it the implied information of it being in some 3D space tied to a reference point. A set of coordinates tied to no reference point (i.e. the Earth) would be meaningless.
In your case I would look at transforming the coordinates to a new feature, maybe a new distance feature. Without knowing more about your particulary task but you could calculate the distance between the coordinates to each other, or your reference point. That might be more useful than just coordinates. Something to think about.
hello @davidnealbrown thanks for the csv file. That's a very interesting use case! Some thoughts/questions...
- so as @Thomas_Ott said, RapidMiner has no idea what x, y, and z coordinates are. They are just numerical attributes. But I cannot see any reason why that would be a problem in your use case.
- what are the cluster #s? Are these the results of a segmentation process or some medical terminology?
- I assume you're trying to predict Label via decision tree, right? So your process looks something like this?
Very cool. Keep us posted.
Scott
[EDIT: and thank you - I just googled "Talairach standardized space" and learned a whole new thing.
Thanks!