The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Maintaining Data Resolution with in-equal sampling frequencies
Hello, I was given a data set in which the label which I want to predict is sampled between 1-6 hours but the inputs which effect it are sampled every hour. My first thought would be to average the inputs to match the sampling frequency of the output but I was wondering if there was any way to retain all the data incase something like variation rather than just the average value effects the output. I have attached a scrubbed version of the data set if you would like to take a look. Thanks!
0
Best Answer
-
Telcontar120 RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,635 UnicornYou could retain all the data but you'll need to pivot it so you still end up with only one row per prediction instance (e.g., every 6 hours). Basically you would be creating extra attributes for the additional sample points. Once you have done that you could look at doing additional feature generation to capture things like min/max in the sample window, range, std deviation, or other measures of dispersion in the additional sample points. Take a look at some of the function options in Generate Aggregation for additional ideas.
5