Create Model from Rule
I have a known set of rules from an industry engineering document, expressed as a series of if-then-else statements and some engineering calculations. The output of these rules are similar to classification results in RM.
We do not have data underlying for these rules so we are unable to create models following the normal RM process of learning and training. Is there a way to input these explicit rules into a model and as we gather data over time apply this model to this data to validate the model? Normally, we start with data to create the model, but in this case we want to use RM to somehow create a model with known rules and then validate and improve over time. In summary, it'd be great if there was an easy way to enter the rules in a model independent of underlying data.
Any ideas are appreciated!
Best Answer
-
michaelgloven RapidMiner Certified Analyst, Member Posts: 46 Guru
thanks for the creative suggestion and example @earmijo @Thomas_Ott ! This meets my requirements.
1
Answers
@michaelgloven maybe I'm not understanding you compleletly but couldn't you just input these if-then statements into a Generate Attribute operator? Then as you gather they data, they execute your rules and you get your results? Or do you want to use the if-then statements as a starting point and then as your data comes in, you adjust the if-then statements to fit the data?
more of your second point....for example, the generate attribute set of rules will not, as far as I know, allow me to create performance vectors to allow me to see how well my model (rules) are performing. I'm trying to bridge from potentially unreliable "fixed rules" to "data driven rules", but I don't have any data yet. So, if I was for example able to "import" induction or decision-tree rules into a blank model, I would then be able to input data over time and use cross validation to determine performance. This may tell me the historical rules are wrong and I need to modify my features or methods to make predictions. I suspect there is bias and over conservatism in some of these original rule-sets, and believe there is a value prop in using RM to support my assertion.
You could. Following @Thomas_Ott suggestion, you could generate a new attribute, change its role from "regular" to "prediction" and compare it to the label (once you have data of course).
Take a look at the following simple example.