The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Filter clustering sentences
Hi dear professors
I want to cluster in a data packet containing 1000 sentences in the rapidshare, and then save the clusters in separate data and do the next job.
I entered the data for this. I used the process document operator and then clustered
But words are separated in clusters. And not sentences. How to save clustered sentences in a file?
Thankful
0
Answers
When you use the Tokenize operator, select "linguistic sentences" rather than "non-word characters" as your token definition.
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts
Hello dear engineer
I mean, I have data twitter data and I want to pre-process tweets. Then put the pre-processed tweaks into an Excel file. Then cluster them up. And I can tell which tweets are in what clusters. Is it possible? How?
I used the process document operator. But as a result, it pre-processed the words and the matrix tf-idf. I want to have preprocessed tweets (in sentence) and I can cluster my tweets ...
??
Can you post your process so we can see what you are doing and more easily suggest corrections? This is a community of volunteers and sometimes it is difficult to understand exactly where you are getting stuck, and other times our attempts to answer questions or supply solutions verbally get lost in translation.
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts
Hello
thanks a lot
I want to preform the sentences of tweets (without stop words and symbols, etc.) into an Excel file, and then I can cluster the sentences.
is it possible?
And then I can put the sentences of each cluster in another Excel file
??
@jabra post a sample of your process, please. It's hard to visualize what you really want to do. Search the Community, there are sample processes that do this that can help you get started.