The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

word list

hasnoooooorhasnoooooor Member Posts: 2 Learner I
edited December 2019 in Help
i have a model that has been trained and tested with dataset a. now i am inserting a new dataset into the model and try to get the wordlist output. the problem is if i did not connected the wordlist output from dataset a, the missing attribute problem will appear. but, when i connected the old wordlist data along with the new datasets, i got the wordlist for dataset a and not for the new dataset.. how can i do if i want the wordlist for the new example set? 

Answers

  • MartinLiebigMartinLiebig Administrator, Moderator, Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,533 RM Data Scientist
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • hasnoooooorhasnoooooor Member Posts: 2 Learner I
    Hi,
    yes.. i already read it. currently i am doing this, and i would like to have an output of word list from the new input example set, but the result keep giving me the word list that i have for training.
  • kaymankayman Member Posts: 662 Unicorn
    The word list is to be seen as a filter, or in other words, new words are ignored as they were not included in the training and are therefore not relevant for the model. 

    Since your new data will be labeled using the words found during training only, it is therfore kind of logical the same words are returned. If you want to include new words it requires retraining of your data. 
  • Telcontar120Telcontar120 RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,635 Unicorn
    You could also convert your wordlist to data from the Toolbox extension and then use the normal Join operators with another dataset that you create that has only the words you want in it.

    Brian T.
    Lindon Ventures 
    Data Science Consulting from Certified RapidMiner Experts
Sign In or Register to comment.