The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Time based term frequency analysis
dawidprozesky
Member Posts: 1 Learner I
in Help
Hi, I explored rapidminer a while ago, and have now returned with a specific analysis which I hope to achieve. I have a data set in Excel with the following columns:
Date (dd/mm/yyyy format) | Body of Text (text) | Publisher (name)
So each record in the data set relates to a specific body of text published at a specific date, and the name of the publisher.
My end goal is to identify words/terms in the texts which started occurring after a given date (i.e. after 1 January 2010), as well as see the word/term frequencies of these identified words/terms over time (can be per year) after the given date.
My current config is: Read Excel - Nominal to Text - Process Documents from Data (tokenizing, filtering and transforming) - Wordlist to Data
I am very new to rapidminer, so any assistance would be really appreciated!!
Date (dd/mm/yyyy format) | Body of Text (text) | Publisher (name)
So each record in the data set relates to a specific body of text published at a specific date, and the name of the publisher.
My end goal is to identify words/terms in the texts which started occurring after a given date (i.e. after 1 January 2010), as well as see the word/term frequencies of these identified words/terms over time (can be per year) after the given date.
My current config is: Read Excel - Nominal to Text - Process Documents from Data (tokenizing, filtering and transforming) - Wordlist to Data
I am very new to rapidminer, so any assistance would be really appreciated!!
0
Answers
As far as looking for occurrences after a specific date, a simple Filter Examples should suffice to handle that.
This should get you started.
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts