The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Discover hidden rules among data
Hello, I'm playing a bit with RapidMiner and I'm doing some experiments.
I would like to know if is possible to discover hidden rules among related data.
Here an example:
Here the hidden rule is: RESULT= (P1 * P2) + A1 + A2
How can I achieve this with RapidMiner? Can I discover the rule if it exists?
And if a precise rule doesn't exist, how can I predict (the near) in the last row RESULT value based on the previous examples?
Thanks
Luciano
I would like to know if is possible to discover hidden rules among related data.
Here an example:
How can I achieve this with RapidMiner? Can I discover the rule if it exists?
And if a precise rule doesn't exist, how can I predict (the near) in the last row RESULT value based on the previous examples?
Thanks
Luciano
0
Answers
What an interesting question! Although this doesn't look like standard RM fare it is because of a little used lovely that creates new attributes by mathematically combining existing attributes, and remembering their construction. So the following code repeatedly and recursively generates new attributes and skims off the best. If you click on the top right of the meta data view you can display the constructions There remains a little glitch about renaming attributes with the same construction, but a start at least.
Doubtless better results can be achieved by tweaking things with an optimiser, and using more operators but I'm quite surprised at how well a two sign zombie can get on four examples, and you get to understand the output..
Anyways, a really nice puzzle, thanks. PS. There is a more succinct function which does not use P2 !
Result=(P1*P1)+P1+ A1+A2
In my example I used only 4 rows of data only for simplicity, but you can create as much as you want. Also the numbers used to calculate the result can (and should) be random.
Is there a system to resolve with neural nets?
I learned a lot from your code. If you have other ideas please post them.
Thanks.
Meta data is underspecified. Cannot check precondition.
when I attempt to run the program. Help needed by this relative newbie to rm.
Thanks
It looks like you've got the data sorted out ( I seem to remember filling in the last question mark as 383 and just pasting into a csv ), so in fact you're good to go. I also get that message, it is a warning which means that RM cannot validate the setup, it does not necessarily mean that it cannot run the process.
In this process that sounds about right, because it has something of a dynamic approach, as it recursively generates and tests. So ignore the warning and press the start button, you have nothing to lose but your sanity