The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Clustering for database of texts

Mary61Mary61 Member Posts: 2 Learner II
edited June 2020 in Help
Hi , i have the same problem and thank you for your answer. i also have a text and by "process document " i separated each text to words. i have 100 texts i need to do classification. Anyone can tell me how i can choose one word for each row as a class, so i can use the clustering operator.


Answers

  • Telcontar120Telcontar120 RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,635 Unicorn
    You don't need to select only one word to do clustering.  The point of clustering is to operate in multi-dimensional space.  Just create your word vectors using text processing and then use an algorithm like k-means to separate your documents into clusters based on overall similarity.
    Brian T.
    Lindon Ventures 
    Data Science Consulting from Certified RapidMiner Experts
Sign In or Register to comment.