The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

How to save an optimized predictive model and use in a NEW process with NEW data

adamfadamf Member Posts: 34 Contributor II
edited November 2018 in Help

Hello,

 

After creating a process that uses cross-validation and optimization to train and generate an optimized predictive model, I would like to SAVE the model (to the repository?) so that I can use it in a NEW process to analyze and make predictions with NEW data.  What is the procedure to do this?

 

Thank you,

Adam

Answers

  • Thomas_OttThomas_Ott RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,761 Unicorn

    @adamf Yes, you need to save the model in a repository and then get your scoring data.  Load in your scoring data, use an Apply Model operator, and connect in your Model. Then run it and the output will have the predictions. 

  • adamfadamf Member Posts: 34 Contributor II

    Thanks Thomas.  To save the model and then use it in another process, I would use the Store and Retrieve operators, respectively, taking the "mod" output port from the model operator (or in my case, the Cross Validation operator)?

     

    I'm not clear what you mean by "get your scoring data".  When I save the model, am I saving the specifically trained and optimized model based on my original process (which included nested Optimization and Cross Validation operators)?

     

    Regards,

    Adam

  • BalazsBaranyBalazsBarany Administrator, Moderator, Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert Posts: 955 Unicorn

    Hi!

     

    "Scoring data" is the new data. That's the table that you'd like to score with your prepared model. 

    Usually the scoring data don't have labels - you get the predictions by applying the model on them.

     

    Regards,

    Balázs

Sign In or Register to comment.