The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
"creating a Word List"
hi,
I would like to create a word list for a list of documents.
I read fromthe word vector tool tutorial that the following chain of operators can help : TextInput, CorpusBasedWeighting and InteractiveAttributeWeighting.
I tried it but at lost at what to fill in for parameter class_to_characterize for CorpusBasedWeighting. I have no class label specified as there is no class involved. I have document name as ID and the content as attribute for the input.
Appreciate any advice and guidance.
I would like to create a word list for a list of documents.
I read fromthe word vector tool tutorial that the following chain of operators can help : TextInput, CorpusBasedWeighting and InteractiveAttributeWeighting.
I tried it but at lost at what to fill in for parameter class_to_characterize for CorpusBasedWeighting. I have no class label specified as there is no class involved. I have document name as ID and the content as attribute for the input.
Appreciate any advice and guidance.
Tagged:
0
Answers
you cannot use the weighting if you don't have labels. That's because the weighting expresses the importance of words for distinguishing documents of the different labels. If you don't have labels, there's nothing to distinguish and hence no weighting...
What do you need the word list for? Perhaps you can simply generate the standard word list automatically by processing the documents using something like that: Greetings,
Sebastian