The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Deployment of models with dummy encoded attributes

sectynsectyn Member Posts: 25 Maven
I have a categorical attribute for which I am doing dummy encoding and this generates a few extra columns. Now supposing my score data does not have some of those columns, during scoring of the data, it mentions that the columns which are in the train data but not the score data will be augmented by using mean or mode values. Where can I specify that it should be mean or mode? Or if I do not want either the mean or mode, but just 0, how can I do that?

Best Answer

  • MartinLiebigMartinLiebig Administrator, Moderator, Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,533 RM Data Scientist
    Solution Accepted
    Hi,
    what you want to use is the preprocessing model of Nominal to Numerical. If you apply this, then you get exactly the same columns as in training.

    Best,
    Martin
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany

Answers

Sign In or Register to comment.