The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
"Dynamic Text Weights"
Hi,
I'm experimenting a bit with the text and webmining features around and was wondering if someone is able to provide some suggestions for the following issue.
I would like Rapidminer to read the text and create relationships between the words. For example lets say:
"Right now it is Friday evening and I am typing this in the library."
Now it should give words closer to another a higher relationship factor than those farer apart. Lets say after stopwords for the word Friday {evening, now} would have 1, {now, I} would have 0.5 and so on. At the same time it should ideally adjust the relationship factor based on how frequent words appear at all and in each others context. Thus it would become clear between which words stronger relationships exist. In a next step one could maybe even define a positive impact and negative impact list and determine how a subject is seen and maybe even devleop a timeline perspective based on this.
I had worked in the past with CatPac (http://www.galileoco.com/N_catpac.asp) which has this ability to create these distance vectors between words and I was hoping something like this could get reproduced with RapidMiner.
I'm experimenting a bit with the text and webmining features around and was wondering if someone is able to provide some suggestions for the following issue.
I would like Rapidminer to read the text and create relationships between the words. For example lets say:
"Right now it is Friday evening and I am typing this in the library."
Now it should give words closer to another a higher relationship factor than those farer apart. Lets say after stopwords for the word Friday {evening, now} would have 1, {now, I} would have 0.5 and so on. At the same time it should ideally adjust the relationship factor based on how frequent words appear at all and in each others context. Thus it would become clear between which words stronger relationships exist. In a next step one could maybe even define a positive impact and negative impact list and determine how a subject is seen and maybe even devleop a timeline perspective based on this.
I had worked in the past with CatPac (http://www.galileoco.com/N_catpac.asp) which has this ability to create these distance vectors between words and I was hoping something like this could get reproduced with RapidMiner.
Tagged:
0