Use dates from Data A to indicate events in Data B
Hi,
Apologise if this has been discussed as I am sure it has I just got find it If so please leave a link to the discussion.
Would like to use the dates in data A (excel sheet), to highlight the corresponding dates in data B (excel sheet) as to indicate significant events. For the use of a time series graphic.
I have tried the operators Append, Join, and Union numerous times with no luck.
Each data set when run succeeds and successfully produce individual tables.
Thank you for any assistance on the matter.
Best Answer
-
lionelderkrikor RapidMiner Certified Analyst, Member Posts: 1,195 Unicorn
Hi @SHSguy,
I believed that you need of Python for an other task but
you don't need of Python, Anaconda or Execute Script to execute the process (to adapt to your dataset) that I shared. You need to :
1. Active the XML panel :
gtgtgtg
2. Copy the code that I shared and paste it in the XML panel :
3. Click on the "Check button" :
4. Normally, the process appears on the main windows
I hope it will be helpful
Regards,
Lionel
1
Answers
Hi @SHSguy,
What do you mean by "highlight" ?. Can you post an example of what you want to do ? and can you too share your dataset(s), please ?
Regards,
Lionel
Hi Lionel,
Thank you for the reply. I don't have an example but maybe I can explain it better:
Data A (Events) states the dates of interest:
2014-01-10
2015-03-11
2016-05-28
Data B (Stock prices) provides the stock prices from 2013 - 2017.
I am trying to use the dates in Data A (red stars) to highlight/indicate the corresponding dates in Data B.
Hi again @SHSguy,
I think I understand :
Does this process answer to your needs ?
Here the link to download the "fictive example sets" of the process above :
https://drive.google.com/open?id=1gZ3gtvU_E760tjWUpKD75ElLHW06rNxO
Don't hesitate to reply, if it's not what you reseach.
Regards,
Lionel
Hi Lionel,
Thank you for the information it looks very impressive. Unfortunately, I have no idea how to utilize the code in Rapidminer. I downloaded the Python3 extension to Rapidminer and Pandas onto Python on the Mac (tried searching for it in extensions - it does not seem to be there.
Cheers,
Hi @SHSguy,
Pandas is a library of Python that you have to install yourself on your mac.
But to install Python3 and all the associated libraries, I recommend you to install with Anaconda
https://www.anaconda.com/download/#macos
Could you adapt and run the code to get what you want about the stock prices ?
Tip : you can add a Write Excel operator at the end of process to save your example set as excel file (in order to create your curves etc.)
Regards,
Lionel
Hi Lionel,
Installed both Anaconda and Pandas and checked both are running, unfortunately, I am getting a script error. If possible could you attach a screenshot of the setup in rapidminer of the process?
The Execute Script operator is only for use with Groovy script code. Do what @lionelderkrikor says above to load in the XML code.
Thank you, that helped a lot. I appreciate the effort.
You're welcome @SHSguy
Don't forget to add the Write Excel operator at the end of your process to save your resulting ExampleSet as an Excel file.
So you you'll be able to build the curves you showed in a previous post.
Regards,
Lionel