The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Attributes do not match, apply model to the classifikation data not possible

linchenlinchen Member Posts: 2 Learner I
edited August 2023 in Help
Hi together, I have some issues with my project. My task is to generate target variable training data  (Retouren_Training.txt),  
which  determines the class membership according  to  the case  description for  the  return rate.
In concrete terms, all customers whose forecast return quota(=  RETOUREN_MENGE/LIEFER_MENGE) is a maximum of 0.18 (ie a  
maximum  of  18%)  are  considered  to  be  low returners. On the other hand, all  customers whose predicted return  rate  is  greater  
than 0.4 (ie greater  than 40%) are considered high returners.  All  other  customers  are  neutral, ie neither low nor high returners.
Furthermore, a data mining model  is to  be created,  which  is  to  be applied  to  the  9,900  customers to be classified as an example (Retouren_Klassigung.txt)  and a  class assignment in neutral, low or high returns.

I created the model, calculated the variable, but the Apply model still doesn't work. Where is the error? Would be very grateful for your support.

Best wishes Lina

Best Answer

  • BalazsBaranyBalazsBarany Administrator, Moderator, Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert Posts: 955 Unicorn
    Solution Accepted
    Hi!

    You are doing some preprocessing inside the cross validation, including generation of new attributes (columns). 

    Then you build models on this altered data.

    Models are built upon the attributes that go into them. Unless the model is doing its own attribute selection, it will expect ALL incoming attributes (with the same name and type) upon predicting with Apply Model. 

    You need to do the same preprocessing on the training data as you will apply later on the testing or prediction data. The easiest way to achieve this is to put the preprocessing into a separate process and then use this process as a subprocess twice in the main process, once for the modeling and then for the model application. 

    Regards,
    Balázs

Answers

Sign In or Register to comment.