The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

CorpusBasedWeighing class_to_characterize

jaskiemrjaskiemr Member Posts: 8 Contributor II
I'm trying to do some text mining and I'm using the CorpusBasedWeighing. What does the class_to_characterize input parameter represent?
Thanks,
        mj

Answers

  • landland RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 2,531 Unicorn
    Hi,
    this parameter defines, which class should be characterized by the weights. Please take a look in the operator doc for more information. The tool tip of the parameter will also help you a lot.

    Greetings,
      Sebastian
  • jaskiemrjaskiemr Member Posts: 8 Contributor II
    class to characterize: The target class for which to find characteristic feature weights.

    So if I generate weights for the word vector, does the word list get labeled as a class, like "hurricanes" or something like that.

    What can you do later with these classes?

    Thank you for your help Sebastian,
              mj

  • landland RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 2,531 Unicorn
    Hi,
    no, that's not what I meant. The word list just contains the words and their frequencies found in the documents of each class. And this isn't changed at all.
    But there are attributes created from the word list. What you get is a weighting of these attributes. Just a weight per attribute. This weight should reflect how important this word is for  discriminating this class from the other classes.

    Greetings,
      Sebastian
Sign In or Register to comment.