The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
I need help with predicting an attribute
Hallo all,
I have a dataset that contains the yearly gas use 2010- 2018, temperature from 2010 - 2018, the provinces in the country that i want to analyze, and the speed of wind. What i want to do know is to predict what the usage of gas will be in the future. When i observe the data set it is clearly that the usage of gas is getting less by year and that there is correlation between temperature and gas usage. I tried to get an decision tree but it wont work for some reason.
Can someone help me with predicting the gas usage?
Thanks in advance
I have a dataset that contains the yearly gas use 2010- 2018, temperature from 2010 - 2018, the provinces in the country that i want to analyze, and the speed of wind. What i want to do know is to predict what the usage of gas will be in the future. When i observe the data set it is clearly that the usage of gas is getting less by year and that there is correlation between temperature and gas usage. I tried to get an decision tree but it wont work for some reason.
Can someone help me with predicting the gas usage?
Thanks in advance
0
Best Answers
-
lionelderkrikor RapidMiner Certified Analyst, Member Posts: 1,195 UnicornHi @ramos213,
Have you tried to connect the in port to the Generate Macro operator input port ?
If the issue is still present after doing this connection, please share your data and your process in order we can reproduce and understand your issue.
Regards,
Lionel0
Answers
This seems like you are trying to forecast future usage of gas. If you don't have any future data without labels then you need to go for forecasting related to time series data. Here is a link that helps you understand how time series analysis works.
https://rapidminer.com/resource/time-series-analysis/
Do let us know if this helps. If not, please clearly inform, how your data looks and do you have any unlabelled data to predict and the way you are building models.
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing
I watched the video and it is what i need, but i keep getting an error when i try to forecast. I did everything exactly like the video shows but for some reason the macro doesnt give input to cross validation.
The input of cross-validation is an "example set". I guess you are using a wrong input to cross-validation operator.
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing
I forgot to connect it indeed, but know i have another problem.
Can you provide us with the data and your process? You can download your process by going to FILE --> Export Process and then attach here in the thread with data set.
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing
You have only 9 examples in your initial example set and 4 examples after windowing !!!
Thus you can not perform a 10-folds cross -validation (there are no enough examples).
You can not build a reliable model with so few data : You have to increase significantly the size of your training set....
Regards,
Lionel
PS : if you want absolutly a working process, set the k-folds parameters of CV to k = 4.
You have 27 examples in your initial dataset for your 3 provinces, thus you have 9 examples for each province after splitting according the provinces :
That's you did and you did the right thing but ...I have to insist : 9 examples is no enough to build a relevant and reliable model.
Try to increase the size of your dataset by finding the variables values before 2010 (for example) ....
Thanks you for your understanding,
Regards,
Lionel