The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Input/Output Word List Question

mdcmdc Member Posts: 58 Maven
edited November 2019 in Help
1. What is the meaning of the last 2 numbers after each attribute? Are they significant? I tried loading my own keywords (just a list format) from other sources and did not seem to work.

@number_of_documents 29
@number_of_classes 1
erasable,5,5
array,10,10
including,9,9
plurality_memory,6,6
floating_gate,10,10
transistor_floating,5,5
memory_string,5,5
string,7,7

2. What exactly are the uses of the input and output word list?

3. Is RM 4.4 out yet?

thanks,
Matthew

Answers

  • IngoRMIngoRM Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University Professor Posts: 1,751 RM Founder
    Hello,

    ad 1) the numbers in the word list state
    • the number of documents in which the word occurred (the first number)
    • the number of documents in which the word occurred for each available class (second and following numbers, here you seem to have only one class)
    ad 2) the input and output word lists ensure that the TFIDF calculations are comparable for training and application phase which is important for document classification

    ad 3) RM 4.4 will be release tomorrow during the day

    Cheers,
    Ingo
Sign In or Register to comment.